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1. Introduction 



In this thesis we consider a matrix variate extension of the Cox-Ingersoll-Ross process 
Coxet aLlp85] ), i.e. a solution of a one-dimensional stochastic differential equation 



(see 

of the form 

dS t = ay/StdB t + a(b- S t )dt, S = s eR (1.1) 

with positive numbers a, b, a and a one-dimensional Brownian motion B. Our extension is 
defined by a solution of the p x p-dimensional stochastic differential equation of the form 

dS t = y/S~ t dB t Q + Q T dBj^/St+ (S t K + K T S t + aQ T Q) dt, S = s G M P (R) (1.2) 

where Q and K are real valued p x p-matrices, a a non-negative number and B a p x 
p-dimensional Brownian motion (that is, a matrix of p 2 independent one-dimensional 
Brownian motions). 

While it is well-known that solutions of (1.1), called CIR processes, always exist, the 
situation for (1.2) is more difficult. As we will see in this thesis, it is crucial to choose the 



parameter a in the right way, to guarantee the (strong) existence of unique solutions of 



(1.2), that are then called Wishart processes. We will derive that it is sufficient to choose 



the parameter a larger or equal to p + 1. That is similar to a result given by Bru 1991 



The characteristic fact of (1.1 ) is that this process remains positive for a certain choice 



of b. This makes it suitable for modeling for example an interest rate, which should always 
be positive because of economic reasons. Hence, this is an approach for pricing bonds (see 
the section 5.1). If we want to consider some (corporate) bonds jointly, e.g. because they 
are correlated, the need for a multidimensional extension comes up. See section 5.3 of this 
thesis or Gourieroux 2007, 3.5.2.] for a discussion of this topic. For the Wishart processes, 



we have in the case a > p + 1 that the unique solution of (1.2) remains positive definite 
for all times. 

Another well-known fact is that the conditional(on s ) distribution of the CIR process at a 
certain point in time is noncentral chi-square. We will see that the conditional distribution 
of the Wishart process St at time t is a matrix variate extension of the noncentral chi- 
square distribution, that is called noncentral Wishart distribution. 



An application for matrix variate stochastic processes can be found in Fonseca et al. 



2008 . which model the dynamics of a p risky assets X by 

dX t = diag(X t )[(rl + X t ) dt + ^S t dW t ] (1.3) 

where r is a positive number, 1 = (1, . . . , 1) e W, X t a p-dimensional stochastic process, 
interpreted as the risk premium, and Z a p-dimensional Brownian motion. The volatility 



process S is the Wishart process of (1.2). 
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1. Introduction 



In chapter 2, we introduce some notations and give the necessary background for under- 
standing the following chapters. In chapter 3, a review of fundamental terms and results 
of matrix variate stochastics and the theory of stochastic differential equations is given, 
and in section 3.5 some results are derived concerning a matrix variate extension of the 
Ornstein-Uhlenbeck processes. The main work on the theory of Wishart processes will be 
done in chapter 4, where we give a general theorem about the existence and uniqueness of 



Wishart processes in section 4.2. In section 4.3, we show that some soutions of (1.2) can 



be expressed in terms of the matrix variate Ornstein-Uhlenbeck process from section 3.5 
and in section 4.4 we give an algorithm to simulate Wishart processes. Finally, we give an 
outlook on the applications of Wishart processes in mathematical finance in chapter 5. 



For further readings about Wishart processes, one could have a look at the paper of 
Bru 1991 and for more financial applications, one could consider Gourieroux 2007 or 
Fonseca et al. 2008 , for example. 
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2. Preliminaries 



In this section we summarize the technical prerequisites which are necessary for matrix 
variate stochastics. 

2.1. Matrix Algebra 

Definition 2.1. 

(i) Denote by .M m , n (R) the set of all m x n matrices with entries in R. 
If m = n, we write A4 n (M) instead. 

(ii) Write GL{p) for the group of all invertible elements ofA4 p (M). 

(Hi) Let S p denote the linear subspace of all symmetric matrices ofA4 p (M). 

(iv) Let (S~ ) denote the set of all symmetric positive (negative) definite matrices of 
M P (R). 

(v) Denote by <S+ the closure of Sp in A4 P (M), that is the set of all symmetric positive 
semidefinite matrices of A4 P (R). 

Let us review some characteristics of positive (semi) definite matrices. 

Theorem 2.2 (Positive definite matrices). 

(i) AeS+ if and only ifx T Ax > Vi G W : x ^ 

(ii) AeS+ if and only ifx T Ax > Vi G W : | \x\ \ = 1 

(Hi) A G Sp if and only if A is orthogonally diagonalizable with positive eigenvalues, i.e. 
there exists an orthogonal matrix U G A4 P (M), UU T = I p , such that A = UDU T 
with a diagonal matrix D = diag(Ai, . . . , \ p ), where Aj > 0, i = 1, . . . , p, are the 
positive eigenvalues of A 

(w) If AeS+ then A~ l G S+ 

(v) Let A G Sp and B G A4 q;P (W) with q < p and rank r. Then BAB T G <S+ and, if B 
has full rank, i.e. r = q, then BAB T is even positive definite, BAB T G . 

(vi) A^A G S+ for all A G GL{p) 

(vn) AeSj if and only ifx T Ax >0Vx6M p :i^0 
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2. Preliminaries 



(viii) A G 5+ if and only if A is orthogonally diagonalizable with non-negative eigenvalues 
(ix) M T M G S+ for all M G M % 



Proof. See Muirhead 2005 , Appendix A8] or Fischer 2005 . □ 

On (5+ we are able to define a matrix valued square root function by 

Definition 2.3 (Square root of posi tive semidefinite matrices). 
Let A G <S+. According to Theorem 



2.2 



there exists an orthogonal matrix U G Ai p (M.), 
UU T = I p , such that A = UDU T with D = diag(Ai, . . . , A p ), where Xi > ; % — 1, . . . ,p. 
Then we define the square root of A by \/A = C/diag(\/Ai, . . . , y / A^)t/ T , what is a positive 
semidefinite matrix, too. 



The square root y A is well-defined and independent of U, as can be seen in Fischer 



2005 , for example. 

Remark 2.4. If A e S+ then v^A G <S+ . 

Now, we talk about the differentiation of matrix valued functions. 

Definition 2.5. Let S G A / f p (lR). We define the differential operator D = f or a ^ 

real valued, differentiable functions f : A^ P (M) — > K as the matrix of all partial derivations 



9§-f(S)off(S). 



The following calculation rules are going to be helpful in the next chapters, so we state 
them here. In order not to lengthen this chapter, we only give outlines of the proofs. 

Lemma 2.6 (Calculation rules for determinants). 

For all A,B G M P (R), S G GL(p) and H t :R-> GL(p) differentiable it holds that 

(i) det(AB) = det(A) det(B) and det(aA) = a p det(A) Va G E. 

(ii) If Ae GL(p) orBe GL(p) then det(J p + AB) = det(I p + BA) 
(m) idet(H t ) = det(H t )tr(H^£ t H t ) 

(w) D(det(S)) =det(5)(5- 1 ) T 

(v) det(v4) is the product of the eigenvalues of A. 
If S is furthermore symmetric then 

(vi) £>(det(S)) = det(5)5- 1 

(™) as$^M< S )) = det(S)[(S-i) kl (S-% - (S-i) ik (S-%\ 
where (S^ 1 )^ denotes the i,j-th entry of S~ x 
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2. 1 . Matrix Algebra 



Proof, (i) and (v) can be found in every Linear Algebra book as e.g. in Fischer 2005 , 

(ii) follows from (i). 

(iii) can be proven using the Laplace expansion and the property adj(A) = det(A)A~ 1 for 
the adjugate matrix adj(A). 

(iv) follows by using the Leibniz formula for determinants and again the adjugate property, 
(vi) is just a special case of (iv). 

Finally, (vii) follows from (vi) using J^£>(det(S)) = det^IOS"" 1 )*^" 1 + q^S' 1 ] and 



-s- 1 = -S 



-i/ a 



Lemma 2.7 (Calculation rules for trace). For all A,B, S G A4 P (R) it holds that 

(i) tr(AB) = tv(BA) and tv(aA + B) = atv(A) + tr(B) VaGM 

(ii) tr(A) is the sum of the eigenvalues of A. 



□ 



Proof See |Fischer| [2005] . □ 
Lemma 2.8 (Calculation rules for adjugate matrices). For all A £ GL(p) it holds that 

(i) adj(A) = det{A)A- 1 

(ii) tr(adj(A)) = det(A)tr(A" 1 ) 

Proof. For (i) see |Fischer| |2005] , (ii) is a trivial consequence of (i). □ 

The next Definition gives us a one-to-one relationship between vectors and matrices. 
The idea is the following: Suppose we want to transfer a theorem that holds for multi- 
variate stochastic processes to one that holds for matrix variate stochastic processes S. 
Then we can apply the theorem to vec(5') and, if necessary, apply vec -1 to the resulting 
'multivariate processes' to get the theorem in a matrix variate version. 



Definition 2.9. Let A G Ai m ^ n (M.) with columns G 
function vec : M m ,n{^) — > ^ mn via 



1, . . . , n. Define the 



vec(A) 



\a n J 



Sometimes, we will also consider vec(A) as an element of A4 mn ^( 
Remark 2.10. 

vec(A T ) = : 

where aj G lR n , j = 1, . . . , m denote the rows of A. 
Lemma 2.11. (Cf. \Gupta and Nagar\l200^ [Theorem 1.2. 
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2. Preliminaries 

(i) For A,B E M m , n (R) it holds that tr(A T B) = vec(A) T vec(B) 

(ii) Let A G M Pt m(M), B G M m , n (R) and C G M n , q (R). Then we have 

vec(AXB) = (B T ® A)vec(X) 

Before we continue, we introduce a new notation: For every linear operator A on a finite 
dimensional space we denote by o~(A) the spectrum of A, that is the set of all eigenvalues 
of A. 

Lemma 2.12. Let A G Ai p (M.) be a matrix such that ^ cr(A) + o~(A). Define the linear 
operator 

A:S P ^S P ,X^AX + XA T 
Then the inverse of A is given by 

A' 1 = vec -1 o (I p g) A + A <g> I P Y l o vec 



Proof. Can be shown with Lemma 2.11 (ii), for details see Stelzer 2007, p. 



□ 



For symmetric matrices there also exists another operator which transfers a matrix into 
a vector. 



Definition 2.13. (Cf. \Gupta and Nagar\ \200(\ Definition 1.2.7.]) Let S G S p . Define the 
function vech : S p — > M 2- ^ via 



( Su \ 

Sl2 
S22 



vech(S') 



S 



ip 



such thatvech(S) is a vector consisting of the elements of S from above and including the 
diagonal, taken columnwise. 



Compared to vec, the operator vech only takes the sis±ll distinct elements of a sym- 
metric p x p-matrix. 
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2.2. Some Functions of Matrices 



2.2. Some Functions of Matrices 

In this section we give a brief introduction to matrix variate functions that arise in the 
probability density function of the noncentral Wishart distribution. 



Definition 2.14. (Borel-o -algebra, cf. Jacod and Protter '2004, V- 48]) Let (X,T) be a 



topological space. The Borel-o -algebra on X is then given by the smallest a-algebra that 
contains T and will be denoted by B(X). 

If we work with the spaces R, R n or .M m , n (R) we assume that they have the natural 
topology, which is the set of all possible unions of open balls, given the euclidean metric. 
See B.v.Querenburg |2001 , p. 22] for details on topological spaces. We write B instead of 



B(R), B n instead of B(R n ) and B m ' n instead of B(M mjTl (R)). 

Definition 2.15 (Integration). Let f : A4 m:n (K) — >■ R be a B m,n -B -measurable function 
and M e B" 1 '' 1 a measurable subset o/.M mjn (R) and let A denote the Lebesgue-measure 
on (R mn , B mn ). The integral of f over M is then defined by 

[ f{X)dX:= [ f(X)d(\ ovec)(X) = [ f o vec _1 (x) dX(x) 

JM JM Jvec(M) 



We call A o vec the Lebesgue-measure on (A4 m , n (R), B m ' n ). 

p(p+i) 

Because S p is isomorphic to K a we know that for p > 2 that S p is a real subspace 
of j\Ap(JRL) and hence a A o vec-Null set. Thus, we define another Lebesgue measure on the 
subspace of all symmetric matrices S p . This integral is always meant if we integrate on a 
subset of S p , like e.g. S p , because an integral w.r.t. A o vec would always be zero. 

Definition 2.16 (Integration II). Let f : S p — )■ R be a B(S P )-B -measurable function and 
M G B{S p ) a Borel measurable subset of S p and let A denote the Lebesgue-measure on 
(]R p(P 2 + ) j B P(P 2 "). The integral of f over M is then defined by 

f(X)dX:= [ f(X)d(\ovech)(X) = [ f o vech" 1 ^) d\(x) 

JM Jvech(M) 

We call A o vech the Lebesgue-measure on (S P ,B(S P )). 



The following definition is just for convenience and can also be found in Gupta and 



Nagar 2000 



Definition 2.17. For A e M P (R) define etr(A) := e tv{A) . 
Definition 2.18 (Matrix Variate Gamma Function). 



T p (a) := I etr(-A) det(A) a -^ p+l) dA V a > - — - 
Js+ 2 
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2. Preliminaries 



Gupta and Nagar 



2000, Theorem 1.4.1.] show that, for a > the matrix variate 
gamma function can be expressed as a finite product of ordinary gamma functions. Thus, 
we do not need to worry about the existence of the matrix variate gamma function. 

The definition of the Hypergeometric Function is a little bit cumbersome and needs 
further explanations. Like in Muirhead 2005 by a symmetric homogeneous polynomial 
of degree k in y%, . . . , y m we mean a polynomial which is unchanged by a permutation of 
the subscripts and such that every term in the polynomial has degree k. Denote by Vk 
the space of all symmetric homogeneous polynomials of degree k in the p(j> ^ distinct 
elements of 5* e <Sj\ Then, ti(S) k = (Sn + • • • + S pp ) k is an element of According to 



Gupta and Nagar 2000 Vk can be decomposed into a direct sum of irreducible invariant 



subspaces V K where k is a partition of k. With a partition k of k we mean a p-tuple 
k = (ki, . . . ,k p ) such that k\ > . . . > k p > and k\ + . . . + k p = k. 

Definition 2.19 (Zonal Polynomials). The zonal polynomial C K (S) is the component of 
tr(S) k in the subspace V K . 



The Definition implies that tr(S) k = C K (S) (according to Gupta and Nagar 
Finally, we are able to state 



2000 



)• 



Definition 2.20 (Hypergeometric Function of matrix argument). 
The Hypergeometric Function of matrix argument is defined by 



i_F n (ai, . . . , a m ; bi, . . . , 6„; S) — 



ai 



k=0 k 



(pi) K ■ ■ ■ (b n ) K k\ 



(2.1) 



where a^, bj el, S is a symmetric p x p-matrix and ^ K the summation over all partitions 
k ofk and (a) K = YYj=i ( a ~ \{j ~ ^))k denotes the generalized hypergeometric coefficient, 
with = x(x + 1) • • • (x + kj — 1). See Gupta and Nagar ^2000, p. 30]. 



Gupta and Nagar |2000 p. 34] discuss conditions for the convergence and thus well- 



definedness of (2.1). A sufficient condition is m < n+ 1. 
Remark 2.21. n F n {a Xl . . . , a n ; a u . . . , a n ; S) = Y,T=o 



etr(S) 



The following Lemma eases later on the calculation of expectations of functions of 
noncentral Wishart distributed random matrices. 

Lemma 2.22. Let Z,T eS+. Then 



etv(-ZS) det(S') a 2 m F n (ai, . . . , a m ; b 1: . . . , b n ; ST) dS 
= T p (a) det(Z)" a m+ iF n (ai, . 



ffl mi o; bi, . . . , b n ; Z l T) 



Va > 2=1. 



Proof. This is a special case of Gupta and Nagar 2000, Theorem 1.6.2] 



□ 
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3. Matrix Variate Stochastics 



Definition 3.1. A quadruple (Q,G,(Gt)tm + ,Q) is called a filtered probability space if 
Q is a set, Q is a a -field on fl, (Gt)teM+ is an increasing family of sub-cr-fields of Q (a 
filtration j and Q is a probability measure on The filtered probability space is said 

to satisfy the usual conditions if 

(i) the filtration is right continuous, i.e. f] s>t G s = Gt for every t > 0, and 

(H) if (Gt)tm+ is complete, i.e. Go contains all sets from G having Q-probability zero. 

Throughout this thesis we assume (fl, J 7 , (J-t)teM. + , P) to be a filtered probability space 
satisfying the usual conditions. 



3.1. Distributions 



Now we summarize a few results and definitions from Gupta and Nagar 2000 



Definition 3.2 (Random Matrix). A m x n-random matrix X is a measurable function 
X:(n,F)^(M m , n (R),Br- 



Definition 3.3 (Probability Density Function). A nonnegative measureable function fx 
such that 



P(X e M) = / f x (A)dA VM e B mxn 

J M 

defines the probability density function(p.d.f.) of a m x n-random matrix X. 



Recall that the Radon-Nikodym theorem (see Jacod and Protter 2004, Theorem 28.3]) 
says that the existence of a p.d.f. of X is equivalent to saying that the distribution P x , 
of X under P, is absolutely continuous w.r.t the Lebesgue-measure on (M. m>n (R), B m,n ). 

Definition 3.4 (Expectation). Let X be a m x n-random matrix. For every function 
h = (h^ij : M m , n (R) ->■ -M r , s (R) with : M m , n (R) -)> E, 1 < i < r, 1 < j < s, the 
expected value E(h(X)) of h(X) is an element of J^A r ^{R) with elements 

E(h(X)) l3 = E(h l3 (X)) = [ h l3 \A) P x (dA) 

As a matter of fact, E(h(X))ij = J M hij(A)f x (A) dA if X has p.d.f. fx- 
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3. Matrix Variate Stochastics 



Definition 3.5 (Characteristic Function). 

The characteristic function of a m x n-random matrix X with p.d.f. fx is defined by 



E[etr(iXZ T )]= [ eti(iAZ T )f x (A)dA (3.1) 



' M 

for every Z G AA P (M). 
Remark 3.6. 

• Because of \exp(ix)\ = 1 Vi G R the above integral always exists. 



• If the distribution of X under P is denoted by P x , then (3.1) is the Fourier trans- 
form of the measure P x at point Z and will be denoted by P X (Z). 

From now on, with the term "X in 5+ is a random matrix" we mean that X is a 
random matrix with X(u) G «S+ for almost all oj G Q. 

Definition 3.7 (Laplace transform). 

The Laplace transform of a p x p-random matrix X in Sp with p.d.f. fx is defined by 

E[etr(-UX)} = I etr(-C7 'A) f X (A) dA (3.2) 
Js+ 

for every U G <S+ 
Remark 3.8. 

• From Lemma 2.11 we see that ti(UA) = vec(U) T vec(A) represents an inner product 



on M., 



For A,U G <S+ we have that tr(-UA) = -tr(\/UAVU) < 0, because y/UAy/U 



IS 



positive definite. Thus, the integral in (3.2) is well-defined. 



Basically, Levy's Continuity Theorem says that weak convergence of probability mea- 
sures is equivalent to the pointwise convergence of their respective Fourier transforms. 

Theorem 3.9 (Levy's Continuity Theorem). Let (fi n ) n >i be a sequence of probability 
measures on Ai p (M), and let {p^) n >i denote their Fourier transforms. 

(i) If [i n converges weakly to a probability measure fi, fi n w ^ \i, then jl^(Z) j2(Z) 
pointwise for all Z G Ai p (R). 

(ii) If f : M P (R) -)■ R is continuous at and fTn(Z) f(Z) for all Z G A^ P (R) ; 
then there exists a probability measure fi on Ai p (M) such that f(Z) = fi(Z), and 

weak 
fl n -> fl. 



Proof. See Jacod and Protter 2004, Theorem 19.1] for a proof of a multivariate version 



that can be extended to the matrix variate case easily. □ 



10 



3.1. Distributions 



With the term analytic function we mean a function that is locally given by a convergent 
power series. 



Lemma 3.10. ( Cf. \ Jurek and Mason '1993, Lemma 3.8.4-]) 

Let f be a real-valued analytic funtion on Ai p (M) which is not identically equal to zero. 
Then the set {x G Ai p (R) : f(x) = 0} has p 2 -dimensional Lebesgue measure zero. 

Definition 3.11 (Covariance Matrix). Let X be a m x n-random matrix and Y be a 
p x q-random matrix. The mn x pq covariance matrix is defined as 

cov(X,Y) = cov(vec(X T ),vec(y T )) 

= £[vec(X T )vec(Y" T ) T ] - £[vec(X T )]£[vec(Y" T )] T 

i.e. cov(X, Y) is a m x p block matrix with blocks cov(xj T , y~j T ) G Ai n!q (M.) where x~i (or 
resp. y~i) denote the rows of X (or resp. Y). 

Definition 3.12 (Matrix Variate Normal Distribution). A p x n-random matrix is said 
to have a matrix variate Normal distribution with mean M G A / f Pin (lR) and covariance 
£ <g) * where E 6 <S+ $ 6 S+, if vec(X T ) ~ A/" pn (vec(M T ), S <g> where J\f pn denotes 
the multivariate Normal distribution on W° n with mean vec(M T ) and covariance £ <S> 
We will use the notation X ~ N Pjn {M, E(8$). 

Lemma 3.13. (Cf. \Gupta and Na~g~al\ fMoty Theorem 2.3.1.]) If X ~ N p , n (M, £ ® 
ffcen X T ~ A/"n, P (M T , * ® E) 

Theorem 3.14 (Characteristic Function of the Matrix Variate Normal Distribution). 

Let X ~ J\f Pt1l (M, £ <8> T/ien £/ie characteristic function of X is given by 



E[etr(iXZ T )] = etr ^Z T M - ^Z T £ 



(3.3) 



Proof. Cf. Gupta and Nagar 2000 Theorem 2.3.2] 



□ 



If E and \I/ are not positive definite, but still positive semidefinite we will use (3.3) as 
a (generalized) definition for the matrix variate Normal distribution. 



Theorem 3.15. Let X ~ Af Pin (M, E g> A G .M TO , ff (R), 5eM 
C G .M„, ff (R). T/ien A + BXC ~ Af m>g (4 + BMC, (BJ:B T ) <g> (C T ^C)). 



and 



Proof. Follows from Theorem |3.14 

E[etr(i(A + 5XC)Z T )] = etr(zAZ T )E[etr(iX(CZ T J B))] 



etr^AZ^etr \iCZ L BM - -CZ BEB ZC ^ 
etr ( iZ T (A + BMC) - -Z r (BEB T )Z(C T ^C) 



□ 
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3. Matrix Variate Stochastics 



Definition 3.16 (Noncentral Wishart Distribution). A p x p-random matrix X in is 
said to have a noncentral Wishart distribution with parameters p G N, n > p, £ G S£ 
and G JA P (R) if its p.d.f is given by 



fx(S) 



(2>r p (^)det(S)f) _1 etr f-i(6 + E" 1 ^ det(S)^ n - p - 1 \F 1 



n 1 
2' 4 



where S G <S+ and o-Pi £s hyper geometric function. We write X ~ W p (n, E, G). 
Remark 3.17. 



(3.4) 



The requirement n > p assures that the matrix variate gamma function is well- 
defined. For the case p — 1 < n < p or resp. n G {1, . . . ,p — 1} one could use 
the Laplace transform (3.5) or resp. Lemma 3. 20\ to define the Wishart distribution 
for this case. However, Olkin and Rubin '1961, Appendix] have shown for non- 
integer n < p—1 that (3.5) is not the Laplace transform of a probability distribution 
anymore. 

If £ G S+\Sp the p.d.f. does not exist, but we can define the noncentral Wishart 



distribution using the characteristic function from Theorem 3.19 



If Q = 0, X is said to have (central) Wishart distribution with parameters p,n and 
S G Sp and its p.d.f. is given by 



2Wr p (£)det(E)i 



etr 



E^S j det(S)^ 



h(n-p-l) 



where S G and n > p (see Gupta and Nagar (2000, Definition 3.2.1]). 

Theorem 3.18 (Laplace Transform of the Noncentral Wishart Distribution). 

Let S ~ W p (n, E, 6). Then the Laplace transform of S is given by 

E(etr(-US)) = det(I„ + 2£[/)-tetr[-e(J p + 2£[/)- 1 £f/] 

with U eS+ 
Proof. 

E(eti-US) = 



(3.5) 



etr(-US)f s (S) dS 



2>r p (^)det(£) i 



-i 



etr 



-0 



x 



J + etr (-US - l^S^J det(S)^ n - p - 1) F 1 (J; hdE^S 



dS 



With Lemma [2T22] and Remark |2j5T we get 
■is 



det(^)i("-P- 1 )etr 

r 
r 



I n 1 

US--E- 1 S) F 1 {-;-QE- 1 S)dS 



, /TT 1 „ i . n „ ,n n 1 ,, r 
- 1 )-' 1 F l (-,- rl{ U 



( „ 2 )det((7+-E 



0£ 



pv|) det(^E- 1 (/ p + 2EC/))-t 1 F 1 (|, |; 1(1, + 2£f/)- 1 0) 
25™p r p (|) det(E)t det(/ p + 2£[/)-?etr(^(/ p + 2£[/)- 1 0) 
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3.2. Processes and Basic Stochastic Analysis 



Finally, 



1 „ 1 



etr(--e + -(/ p + 2Sf/)- 1 0) 

= etr(-^e(/ p -(/ p + 2Sf/)- 1 )) 

= ebr(~Q(I p + 2EU)- 1 (I p + 2EU- I p ) 
= etr(-e(/ p + 2E£/)- 1 E£7) 

and altogether 

E{etr-US) = [ etr{-US)f s {S) dS = (2* np T p (^) det(£)*) _1 etr(- ^G) 
Js+ 2 2 

x I eti(-US- -Z- 1 S)det(S)^ n ~ p ~ 1) F 1 (-;-ei:- 1 S)dS 
Js+ 2 2 4 

= det(/ p + 2EC/)-?etr[-e(/ p + 2E[/)- 1 Ef/] 

□ 

Theorem 3.19. (Characteristic Function of the Noncentral Wishart Distribution, cf. 



Gupta and Nagar\ fMUty Th eorem 3.5.3.]) 



Let S ~ W p (n, E, 0). JTien #ie characteristic function of S is given by 

E(etr(iZS)) = det(I p - 2iEZ)-?etr[iG(I p - 2iEZ)~ x EZ] (3.6) 
wt/i Z e M P (R) 

The next Lemma shows that the noncentral Wishart distribution is the square of a 
matrix variate normal distributed random matrix. Hence, it is the matrix variate extension 
of the noncentral chi-square distribution. 

Lemma 3.20. (Cf. \Gupta and Nagar\ JMUty Theorem 3. 5.1. J) 

Let X ~ J\f Pjn (M, E <g> I n ), n e {p,p + 1, . . .}. Then XX T ~ W p (n, E, E^MM 7 ). 

Consider S ~ W p (n, E, B). With the foregoing Lemma we can interpret the parameter 
E as a scale and the parameter as a location parameter for S. Especially, a central 
Wishart distributed matrix may be thought of a matrix square of normally distributed 
matrices with zero mean. 



3.2. Processes and Basic Stochastic Analysis 

First, we make a general definition. 

Definition 3.21 (Matrix Variate Stochastic Process). 

A measurable function X : IR + x O — > A^ mjTl (R), (t, uS) \- > X(t,w) = X t (u) is called a 
(matrix variate) stochastic process ifX(t,u) is a random matrix for allt G M+. 
Moreover, X is called a stochastic process in 5+ if X : K + x O — > S+. 
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3. Matrix Variate Stochastics 



With IR + we always mean the interval of all non-negative real numbers, i.e [0, oo). 
We can transfer most concepts from real stochastics to matrix variate stochastics, if we 
say a matrix variate stochastic process X has property P if each component of X has 
property P. We explicitly write this down for only a few cases: 

Definition 3.22 (Local Martingale). A matrix variate stochastic process X is called a 
local martingale, if each component ofX is a local martingale, i.e. if there exists a sequence 
of strictly monotonic increasing stopping times (T n ) nf z^,T n —i oo, such that X m ^ n ^ n },ij 
is a martingale for all i,j. 

Definition 3.23 (Matrix Variate Brownian Motion). 

A matrix variate Brownian motion B in Ai niP (M.) is a matrix consisting of independent, 
one- dimensional Brownian motions, i.e. B = (Bij)ij where B^ are independent one- 
dimensional Brownian motions, 1 < % < n, 1 < j < p. We write B ~ BAi n # (and 
B ~ BM n if p = n). 



Remark 3.24. B t ~ J\f ntP (0,tI, 



up j 



Theorem 3.25. Let W ~ BM n , p , A e M m ^(R), B £ M m .n(M) and C G M p>q (R). Then 
A + BW t C ^M m , q {A,t{BB T )®{C T C)). 



Proof. Follows from Theorem 3.15 and the fact that I np — I n ® I p □ 



Definition 3.26 (Semimartingale). A matrix variate stochastic process X is called a 
semimartingale if X can be decomposed into X = Xq+M+A where M is a local martingale 
and A an adapted process of finite variation. 

We will only consider continuous semimartingales in this thesis. 

For a n x p-dimensional Brownian motion W ~ BM. n ^ pi stochastic processes X resp. Y 
in Ai mtn {R) or resp. A4 Ptq (R) and a stopping time T the matrix variate stochastic integral 
on [0, T] is meant to be a matrix with entries 

/ i>T \ n p „T 

/ X t dW t Y t ) =J2J2 X t,kY m dW tM Vl<z<m,l<j<g 
\ Jo ' iJ k=l l=i Jo 

Theorem 3.27 (Matrix Variate Ito Formula on Open Subsets). Let U C A^ mjn (lR) be 
open, X be a continuous semimartingale with values in U and let f : U — > 1R be a twice 
continuously differentiable function. Then f(X) is a continuous semimartingale and 



f(X t ) = f(X ) +tr U Df(X s fdX s j + ^ I E E dXi , dx J (X s )d[X l3 ,X klls 

(3.7) 



mth D = (dkl)iJ 



Proof. Follows from Revuz and Yor 2001 Theorem 3.3 and Remark 2, Chapter IV] and 



Lemma [2TT1 □ 
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3.3. Stochastic Differential Equations 



Corollary 3.28. Let X be a continuous semimartingale on a stochastic interval [0, T) 
with T = inf{t : X t U} for an open set U C M mjn (M.), X G U, and let f : U ->■ E 6e a 
twice continuously differentiable function. 

Then T > 0, (/(^))te[o,T) is a continuous semimartingale and (3.1) holds for t G [0, T). 

Proof. For any subset A C A / f p (IR) , define the distance between A and any point x G 
A / fp(IR) by d(x,A) := inf zg A c?(x, z). Clearly, it exists an IV = iV(u;) G N such that 
d(X (cj), dU) > 4. The continuity of X and X G Z7 a.s imply Af < oo a.s., and as X is 
adapted, (T n ) n >jy with 

T n := inf{t > : d(X t , dU) < -} 

n 

defines a sequence of positive stopping times. For every n > N, the stopped process 



X n := X tA T n is a continuous semimartingale with values in U, thus Theorem 3.27 



can 



be applied to see that f(X n ) is a continuous semimartingale and (3.7) holds for X n . 
Because T n converges to T = inf{t > : X t G dU}, we can conclude T > and that 
(f(X)) te [Q ) T) is a continuous semimartingale and (3.7) holds for (X) te [ 0jT ). □ 

With the Definition of a 'Matrix Quadratic Covariation' we are able to state the matrix 
variate partial integration formula in a handy way. 

Definition 3.29 (Matrix Quadratic Covariation). For two semimartingales A G Aid,m{^-), B G 
A4 mi n(IR) the matrix variate quadratic covariation is defined by 

m 

[A B]tjj = J2i A ik,B kj ] t G M d , n (R) 

k=l 



Theorem 3.30 (Matrix Variate Partial Integration). (Cf. Barndorff- Niels en and Stelzer 
{ 2007\ Lemma 5.11.]) Let A G Aid, m (M), B G Ai min {R) be two semimartingales. Then 
the matrix product A t B t G AidniM) is a semimartingale and 



A t B t = [ A t _dB t + [ dA t B t - + [A,B] 
Jo Jo 



t 



where A t _ denotes the limit from the left of A t 



3.3. Stochastic Differential Equations 

When we talk about (matrix variate) stochastic differential equations, we can classify 
between two different kind of solutions: Weak and strong solutions. Intuitively, a strong 
solution is constructed from a given Brownian motion and hence a 'function' of that 
Brownian motion. 



Definition 3.31. (Weak and Strong Solutions, cf. Revuz and Yor 2001, Chapter IX, 
Definition 1.5]) Let (fl,G, (Gt)tm+,Q) be a filtered probability space satisfying the usual 
conditions and consider the stochastic differential equation 

dX t = b(t, X t ) dt + cr(t, X t ) dW t , X = x (3.8) 

where b : E + x M mjn (R) — > A"f m , n (lR) and o : R + x A / f m>n (IR) — > M m ^(R) are measurable 
functions, x G Ai m ^ n {R) and W is a p x n- dimensional Brownian motion. 
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3. Matrix Variate Stochastics 



A pair (X, W) of Tt- adapted continuous processes defined on (Cl,G,(Gt)teR+,Q) ^ s 
called 

a solution of (3.8) on [0, T), T > 0, if W is a (Gt)tes. + -Brownian motion and 

X t = x + [ b(s,X s )ds + [ a(s,X s )dW s Vte[0,T) 
Jo Jo 



Moreover, the pair (X, W) is said to be a strong solution of (3.8), if X is adapted to 
the filtration (Gt V )tm+> where = a c (W s ,s < t) is the a -algebra generated from 
W s ,s < t, that is completed with all Q-null sets from Q . 



• A solution (X,W) which is not strong will be termed a weak solution of (3.8). 

To define the probability law P x of a stochastic process X : IR + x Q — > M. m)n (M.), we 
consider, according to 0ksendal 2000 , X as a random matrix on the functional space 
(■Mm,n(M) R+ , J 7 ) with a- algebra T that is generated by the cylinder sets 

{u e n : X tl (u) eF u .. .,X tk {u) G F k } 

where Ft C A4 mjn (R) are Borel sets and k G N. 

X : (CI, J 7 , P) — > (A^ m ,„(M) K+ , J 7 ) is then a measurable function with law P x . 



Definition 3.32. (Uniqueness, Revuz and Yor (2001, Chapter IX, Definition 1.3]) Con- 
sider again the stochastic differential equation (3.8). 



It is said that pathwise uniqueness holds for (3.8), if for every two solutions (X, W) 
and (X\ W) defined on the same filtered probability space, X = X' and W = W 
a.s. implies that X and X' are indistinguishable, i.e. for P -almost all u> it holds 
that X t (oj) = X' t (oj) for every t, or equivalently 



P I sup \X t -X' t \ > 

te[o,oo) 



There is uniqueness in law for (3.8), if whenever (X,W) and (X',W) are two so- 



lutions with possibly different Brownian motions W and W (in particular if (X, W) 

and (X', W) are defined on two different filtered probability spaces) and Xq = X' Q , 
then the laws P x and P x are equal. In other words, X and X' are two versions of 



the same process, i.e. they have the same finite dimensional distributions (see Revuz 



and Yor (2001, Chapter I, Definition 1.6]). 



Remark 3.33. 



(i) As Yamada and Watanabe ,1571a, Proposition 1] have shown, pathwise uniqueness 
implies uniqueness in law, which is not true conversely. 



(ii) If pathwise uniqueness holds for (3.8), then every solution of (3.8) is strong (see 
Revuz and Yor 1 2001 , Theorem 1.7]). 
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3.3. Stochastic Differential Equations 



(Hi) The definition of pathwise uniqueness implies that there exists at most one strong 



solution for (3.8) up to indistinguishability . 



(iv) According to Skorohod \1965 , p. 59f], the stochastic differential equation (3.8) al- 
ways has a weak (but not necessarily unique) solution if b and a are continuous 



functions. If in this situation pathwise uniqueness holds for (3.8), then there exists 
a unique strong solution up to indistinguishability. 



Similar to the theory of ordinary differential equations, the function on the right hand 
side of a stochastic differential equation being locally Lipschitz is sufficient in order to 
guarantee (strong) existence on a nonempty (stochastic) interval and (pathwise) unique- 



ness. 



Theorem 3.34. (Existence of Solutions of SDEs driven by a continuous Semimartingale, 
cf. Stelzer (2001, Theorem 6.7.3.]) Let U be an open subset of Aid,n(^) an d (U n )nen a 
sequence of convex closed sets such that U n C U, U n C U n+ (i n G N and [j neN U n = 



U. Assume that f : U — >■ Aid, m (^) is a locally Lipschitz function and Z in Ai m 
is a continuous semimartingale. Then for each U -valued J r Q-measurable initial value X 
there exist a stopping time T and a unique [/-valued strong solution X to the stochastic 
differential equation 

dX t = f(X t )dZ t (3.9) 

up to the time T > a.s., i.e. on the stochastic interval [0, T). On T < oo we have that ei- 
ther X hits the boundary dU ofU atT, i.e. Xt G dU or explodes, i.e. Hmsup t _ ) . Tt<T \ \X t \ \ = 
oo. If f satifies the linear growth condition 

||/(X)|| 2 <K(1 + ||X|| 2 ) 

with some constant K G lR +; then no explosion can occur. 

Remark 3.35. With the term unique it is meant that there holds pathwise uniqueness for 



(3.9). In other words, every two solutions (X, Z) and (X', Z) of (3.9) defined on the same 
probability space and with the same continuous semimartingale Z and the same initial 
value are indistinguishable. 



Definition 3.36. (Local Lipschitz, cf. Stelzer (2007, Definition 6.7.1. J) Let (U,\\.\\ u ) l 
(V, \ \.\\v) be two normed spaces and W C U be open. Then a function f : W — )■ V is 
called locally Lipschitz, if for every x G W there exists an open neighbourhood U \x) C W 
and a constant C(x) G 1R + such that 



\\f{z) - f(y)\\v < C(x)\\z - y\\u Wz,ye U(x) 

C(x) is said to be a local Lipschitz coefficient. If there is a K G 
can be chosen for all x G W , f is called globally Lipschitz. 



(3.10) 

such that C(x) = K 



If the process Z from Theorem 3.34 is a continous Levy process (Brownian motion 



with drift), it can be shown that the solution X of (3.9) is a Markov process. Before we 
state this in a theorem, we give a general definition of the term 'Markov process' and a 
necessary technical condition on our probability space. 
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3. Matrix Variate Stochastics 



Definition 3.37. (Markov Process, cf. \Stelzer\ J200% Definition 6.7.7.]) Let U C M P (R) 
be open and Z be a process with values in U which is adapted to a filtration (J r t)t&R + - 

(i) Z is called a Markov process with respect to (J z t)tm.+ , if 

E(g(Z u )\F t ) = E(g(Z u )\Z t ) 
for all t G M + , u > t and g : U — >■ M bounded and Borel measurable. 

(ii) Let Z be a Markov process and define for all s,t G IR + , s < t the transition functions 
P s j{Z s ,g) := E(g(Z t )\Z s ) with g :[/—>■ R bounded and Borel measurable. If 

P Si t = P ,t-s =■ Pt-s for all s,teR + ,s<t 

then Z is said to be a time homogeneous Markov process. 

(Hi) A time homogeneous Markov process is called a strong Markov process, if 

E(g(Z T+s )\F T ) = P s (Z T ,g) = E(g(Z T+s )\Z T ) 

for all g :U — > M bounded and Borel measurable and a.s. finite stopping times T . 

At the beginning of this chapter we assumed J 7 , (J-i)t>o, -P) to be a filtered proba- 
bility space where (J"t)t>o is right continuous and all Tt are complete. Now, we need to 
enlarge our given probability space in order to get arbitrary initial values. 

Definition 3.38. (Enlargement of a probability space, cf. \Protter\ JMU^ ) 

Let (fi, J 7 , (J r t ) t >o, -P) be a filtered probability space where (J r t)t>o is right continuous, i.e. 

fls>t^ r 5 = Ft f or every t > 0, and all T t are completed with sets from T having P- 

probability zero and U C Ai p (M) be an open subset of Ai p (M). 

The probability space (Q, J 7 , (J r j)t>o, {P V )y&u) with 

U = UxVl, F t = f)(j(B(U)xF u ), ~T = a(B{U) x J 7 ), P y = 5 y x P 

u>t 

is called the Enlargement of(fl,J r , (J : i) i >o,P). The measure S y is the Dirac measure w.r.t. 
y G U, i.e. for A C U it holds that S y (A) = 1 if and only if y G A. A random matrix 
Z : Q — )• Aip(R) is extended to Q by setting Z((y,u)) = Z(u) for all (y,co) G Q. 

Eventually we are able to state 

Theorem 3.39. (Markov Property of Solutions of SDEs driven by a Brownian Motion, 
cf. Stelzer ^2007, Theorem 6.7.8.]) Recall the Assumptions of Theorem 3.34 (without the 
linear growth condition) , but let this time Z be a Brownian motion with drift in Ai mjJl (M.), 
i.e. Z ~ BM. m ^ n . Suppose further that T = oo for every initial value xq G U . Consider 
the enlarged probability space (Q,J r ,(J r t)t>o>{P V )yeu) an d define X by X ((y,u)) := y 
for all y G U . Then the unique strong solution X of 

dX t = f(X t )dZ t (3.11) 

is a time homogeneous strong Markov process on U under every probability measure of the 
family (P y ) y&U - 
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3.3. Stochastic Differential Equations 



The Girsanov Theorem is a powerful tool that gives us the opportunity to construct a 
new probability measure Q such that a drift changed P-Brownian motion (that is not a 
Brownian motion under the probability measure P anymore) is a Brownian motion under 
the new measure Q. Before we state the Girsanov Theorem we make 

Definition 3.40 (Stochastic Exponential). Let X be a stochastic process. The unique 
strong solution Z = £(X) of 



dZ t = Z t dX t , 
is called stochastic exponential of X. 



(3.12) 



From Theorem 3.34 we get immediately that (3.12) has a unique strong solution. 

Theorem 3.41 (Matrix Variate Girsanov Theorem). Let T > 0, B ~ BhA v and U be an 

adapted, continuous stochastic process with values in Ai p {R) such that 



£ tr 



UJ dB< 



(3.13) 



te[o,T] 



is a martingale, or, what is sufficient for (3.13), but not necessary, such that the Novikov 
condition is satisfied 

1 



E etr 



UjU t dt 



< oo 



Then 



Q 



£ tr 



dB t dP 



is an equivalent probability measure, and 



B, 



U s ds + B t 



(3.14) 
(3.15) 

(3.16) 



is a Q-Brownian motion on [0, T). 



Proof. We use the multivariate Girsanov Theore m of Kallenberg 



The process V := vec(— U) G MP is according to Revuz and Yor 



9971 Corollary 16.25]. 
20011 Proposition 4.8, 



Chapter I] progressively measurable and B v := vec(B) is a Brownian motion on M. p . The 
Novikov condition is then 



E I exp 




T P 



E v i dt 



E exp 



E etr 



tr(U?U t ) dt 



UjU t dt 



< oo 



With Kallenberg 1997 Corollary 16.25] the new measure Q is then 



Q 



£ 



V t L dB v t ) dP 



£ tr 



T 



Ut 1 dB i 



dP 
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and 



B\ =B v t - / V s ds 
Jo 

is a Q-Brownian motion for t G [0, T] and with values in W 2 . Hence, 

B t = vec- 1 (B?)= [ U s ds + B t 
Jo 

is a Q-Brownian motion for t G [0, T\ and with values in Ai p (M). □ 

3.4. McKean's Argument 

Levy's Theorem gives us an easy way to decide whether a continuous local martingale is 
a Brownian motion. 

Theorem 3.42 (Levy's Theorem). 

Let B be a p x p dimensional continuous local martingale such that 



[Bij, B, 



kl\t 



t, if i = k and j = I 
0, else 



for all i,j, k G {1, . . . , p} and B = 0. Then B is a p x p dimensional Brownian motion, 
B ~ BM P . 

Proof. It suffices to show that X := vec(S) is a p 2 -dimensional Brownian motion. 
Let a, b G {1, . . . ,p 2 }- Then, there exist i, j, k,l G {1, . . . ,p} such that a = p{j — 1) + i 
and b = p{l — 1) + k. Clearly, i = k and j = I imply a = b, and conversely, if we assume 
a = b, then i ^ k or resp. j ^ I imply contradictions. Thus, 

[X a ,X b ] = [Bij,B k i] = tl{ i=k jl{ j= ij = tl{ a= 6} 



With Protter 2004, Theorem 40, Chapter 2] we can conclude that X is a p 2 -dimensional 



Brownian motion. □ 

For every finite stopping time r we say M is a local martingale on the (stochastic) 
interval [0,r ] if the stopped process (M in f{ t ro }) ieK+ is a local martingale. The next The- 
orem shows us that every stopped continous local martingale is a stopped, time-changed 



Brownian motion. There also exists a version of Theorem 3.43 without stopping times 



but with the additional assumption that \im.t-+oo[M, M]t = oo, see the Dambis-Dubins- 



Schwarz-Theorem in Revuz and Yor 2001| Theorem 1.6, Chapter V]. 



Theorem 3.43. Let < To < oo be a stopping time and M be a continuous local mar- 
tingale on the interval [0,Tq], which is not identically equal to zero. If we set 

T t := inf{s : [M,M] S > t} (with the convention inf{0} = oo) 
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3.4. McKean's Argument 



then B t := My t is a stopped J 7 ^- Brownian motion on the stochastic interval [0, [M, M] ro ) 7 
[M,M] T0 > a.s., i.e B is a Brownian motion w.r.t. the a-algebra 



T Tt = {Ae T : An{T t <t} e T % te R+} 



that is stopped at [M,M] T0 . Furthermore, it holds that M t = B<M,M] t > i- e - M is a stopped, 
time-changed Brownian motion. 



Proof. Observe that T t is a generalized inverse function of [M, M] t . Because M is contin- 
uous, and hence [M, M], too, it holds that [M, M]t 4 = t, but still we only have T[M,M]t > t 
in general (and strict equality for every t if and only if M was strict monotonic on the 
entire interval). See Revuz and Yor |2001 p. 7-8] for details. 

Suppose [M,M] T0 = 0. Because [M, M] is an increasing process, [M,M] t = for all 
* G [0,t ], thus E{[M,M] t ) = for all t E [0,r ]. With |Protter] [20011 Chapter II, Corol- 
lary 3] it follows E(Mf) = for all t G [0, 7"o], hence M is equal to zero, which is a 
contradiction. 

The set {T t }teR + is a family of stopping times, because M is adapted. Observe that 



[M, M] ro >t^T t <r 



Hence T t < oo as tq < oo. From Revuz and Yor 2001, Proposition 4.8 and 4.9, Chapter 



I] we know that the stopped process M Tt is J-ri- measura t ) le. Furthermore, 



[B,B} t =[M,M} Tt =t 



as mentioned above. With Theorem |3.42 we can can conclude that B is a stopped 
J^-Brownian motion on [0, [M, M] T0 ). 

To prove that M is a time-changed Brownian motion, observe that T[M,M] t > t if and only 
if [M, M] is constant on [t, T[M,M] t ]- As M and [M, M] are constant on the same intervals, 
we have M T = M t and thus B [MM]t = M t . □ 



With the foregoing Theorem we can form an argument that will allow us in turn to 
proof some result on the existence of the Wishart process in the next chapter. 



Theorem 3.44. (McKean's Argument, cf. McKean 1969, p. 47, Problem 7J) 

Let r be a real-valued continous stochastic process such that P(r > 0) = 1 and h : IR + — > 

such that 



• h(r) is a continuous local martingale on the interval [0, r ) for r := inf {s : r s = 0} 

• ]xm x _+ 0>x>0 h(x) = oo or resp. lim a .^. ,x>o K x ) = _0 ° 

Then tq = oo almost surely, i.e. r t (u) > V£ G R+ for almost all oj. 

Remark 3.45. If h(r t ) = J * f(s,r s ) dW s with a Brownian motion W , then for h{r) being 
a continuous local martingale on [0, To) it is sufficient that f is square integrable on [0, To). 
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Proof. Suppose r < oo. 

Define M := h(r) and T t as in Theorem 3.43 M is a continuous local martingale on [0, To] 
and with Theorem 3.43 we can conclude that B t := Mx t is a stopped Brownian motion 
on [0, [M, M] T0 ). Observe, that r > and the continuity of r impliy r > 0. 
Consider the case lim a ,^ ,x>o h(x) = oo. On the interval [0, To), the process h(r t ) takes 
every value in [h(ro), oo), especially [h(r), h(r)] TQ = [M, M] To > 0. For r < oo we have 

t ->• [M, M] TQ => T t -> r r Tt -> ^ = /i(r r J -> oo 

There are two cases: 

• [M, M] TQ < oo, what is impossible because a Brownian motion can not go to infinity 
in finite time, see Revuz and Yor 2001, Law of the iterated logarithm, Corollary 
1.12, Chapter II]. 



• [M, M] TQ = oo, which implies that B is a Brownian motion on 
oo almost surely and that is a contradiction because, 

P{B t < infinitely often for t — > oo) = 1, 

i.e B has infinite oscillations. 

As both cases imply contradictions, we conclude t = oo. 
The case for lim^o^x) h(x) = — oo is analogous. 



that converges to 



□ 



3.5. Ornstein-Uhlenbeck Processes 



We give the following definition according to Bru 1991 such that we are later able to 
show that some solutions of the Wishart SDE can be constructed out of a matrix variate 
Ornstein-Uhlenbeck process. 

Definition 3.46 (Matrix Variate Ornstein-Uhlenbeck Process). 

Let A,Be M P (M), x e M n , p (M.) a.s. and W ~ BM n , P - A solution X of 

dX t = X t B dt + dW t A, X = x (3.17) 

is called n x p- dimensional Ornstein-Uhlenbeck process. We write X ~ OUV n , p (A, B, xq). 

AsX4 XB and X h> A are trivially globally Lipschitz and satisfy the linear growth 
condition we know from Theorem 3.34 that (3.17) has a unique strong solution on the 
entire interval [0, oo). Luckily, we are even able to give an explicit formula for the solution 
of j3l7| >. 

Theorem 3.47 (Existence and Uniqueness of the Ornstein-Uhlenbeck Process). 
For a Brownian motion W ~ BM. n v , the unique strong solution of (3.11) is given by 



X t 



x e 



Bt 



dW, Ae~ Bs e Bt 



(3.18) 
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3.5. Ornstein-Uhlenbeck Processes 



Proof. 

dX t = d(x e Bt ) + d 



o 



c e Bt Bdi ; dW, A(~ u 'i L " -\ ( J dW s Ae~ as ) < L " B dt ■■■ ■■ d 



dW q Ae~ Bs » 



-BtBt 



Bs \ Rt 





M 






3- Bs ,e B - 






t 

> 



+ \J A )Bdt + dW t A 

X t B dt + dW t A 



Furthermore, (3.18) is a strong solution by construction. 



□ 



The following Lemma will be useful for determining the conditional distribution of the 
Ornstein-Uhlenbeck process. 

Lemma 3.48. Let W ~ BM. n ^ p and X : IR + — > M. p , m (M>), t h- > X t be a square integrable, 
deterministic function. Then 

-t rt 



[ dW s X s ~Af ntTn (0,I n ® [ XjX s ds) 
Jo Jo 



Proof. H t := J^dW s X s . H t E Ain^M.) is a £ 2 -limit of normal distributed, independent, 
random variables and therefore normally distributed (Cf. 0ksendal 2000, Proof of Theo- 
rem 5.2.1 and Theorem A. 7.]). Furthermore, H t is a martingale with a.s. initial value zero 
and thus expectation zero. 

cov(if t , H t ) is a block diagonal matrix, because different rows of W are independent. 
Hence, cov(H t , H t ) is a tensor product of /„ with another m-dimensional matrix. Further- 
more, all rows of W are identically distributed such that we only need to consider row 1 
of H t , i.e. the first block matrix D\ of cov(H t , H t ). We have that 



P rt 

Ht,u = / dW Sy \k X St ki 



and thus 



D t,ij = coy(H tjU , H tAj ) 



P pt rt 

cov( / dW sAkt X sMi , dW^k,X x 
Jo Jo 



E_ 

p 

E 



's,lkj ^S^jj, 



Xs^iiXg^jj dsl{ki=k 3 } 



P rt rt 

/ X s> kiX S) kj ds = (XjX s )ij ds 
fe =i Jo Jo 



where we used the fact that cov( f fx(s) dW s>lki , j f 2 {s) dW s>lkj ) = f fi(s)f 2 (s) ds l{fe j =fc i } 
for deterministic, square integrable real functions f%, f 2 - □ 
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3. Matrix Variate Stochastics 



Now we are able to show that the (matrix variate) Ornstein-Uhlenbeck process is (ma- 
trix variate) normally distributed. Later we will see that some Wishart processes are 
therefore 'square' normally distributed, that is by Lemma 3.20 a noncentral Wishart dis- 
tribution. 

To clarify terms, with distribution of a stochastic process X we always mean from now on 
the distribution of X t at time t conditional on the initial value Xq. Although it may be 
confusing, the terms distribution of a stochastic process and law of a stochastic process 
have a different meaning in this thesis. 

Theorem 3.49 (Distribution of the Matrix Variate Ornstein-Uhlenbeck Process). 

Let X ~ OWPn tP (A, B,x ) with a matrix B e M P (M) that satisfies ^ -cr(B) - a(B). 

Then the distribution of X is given by 

X t \x ~ K, P (x e Bt , I n ® [A-\A T A) - A-\e BTt A T Ae Bt )\) 

where A^ 1 is the inverse of the linear operator A: S p — )■ S p , X h-> —B t X — XB. 

shows us that 



Proof. Define H t = J Q dW s Ae Bs . Lemma 



3.48 



H t ~JV niP (0,I n ® f e~ BTs A T Ae 
Jo 



ds) 



Define the operator A : X \-> -B T X - XB as in Lemma 2.12 Then we have 

d 



ds 



A r Ae- Bs )=A(e- Bs A r Ae- Bs ) 



and hence 



e -B T s A T Ae -Bs ds 



A- 1 {e~ BTs A T Ae- Bs )\ t s 



s=0 



(3.19) 



gives us 



The equality X t = x§e Bt + H t e Bt and Theorem 3.15 

X t \x ~ N n , p (x e Bt , I n ® e BTt A- 1 (e- BTs A T Ae- LS ^ 



s=0 c 



Because A is a linear operator, i is also linear and we can simplify 



A- x (e- BTs A T Ae- Bs )\\ 



s=0 



A 



-B t A T 



A T Ae- Bt )-A-\A T A) 



and, because A 1 is the integral from (3.19), 

Bt ) - A-\A T A))e Bt = A~\A T A) - A~\e BTt A T Ae 



i BTt {A-\e~ BTt A T Ae 



Bt\ 



□ 



Theorem 3.50 (Stationary Distribution of the Ornstein-Uhlenbeck Process). 

Let X ~ OUV n , p (A, B, Xq) with a matrix B e A4 P {R) such that all eigenvalues of B have 

a negative real part, i.e. Ke(a(B)) C (— oo,0). 

Then the Ornstein- Uhlenbeck process X has a stationary limiting distribution, that is 

Af n , P (0,I n ®A-\A T A)) 
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3.5. Ornstein-Uhlenbeck Processes 



Proof. Observe that Re(cr(£>)) C (— oo,0) implies ^ —a(B) — cr{B) and we can use 
the foregoing Theorem. With Theorem |3.14| we get for t G (0, oo) that X t given xq has 
characteristic function 



pxt(z) = etr ( iZ T x e Bt - -Z T Z^ t 



with 



% = A- 1 (A T A) -A' 



A Ae 



(3.20) 
(3.21) 



Now, we write the matrix B in its Jordan normal form: 
There exists a matrix T G GL(p) such that 



B = T 



( Ji \ 

'•• 
V o J m ) 



T 



-i 



with block matrices, called Jordan blocks, J{ G Aii^M.) such that 1% + . . . + 1„ 
the calculation rules for the matrix exponential we get 



exp(Bt) = Texp 



/ / J x t \ \ 

'•• 
\\0 J m t) ) 



T = Tdiag(e 



Jit 



p. With 



(3.22) 



Without loss of generality, consider only the first Jordan block 

/A 1 ... 0\ 



J := Ji 



A 



eMi 



\0 A/ 



with I :— li, A = a + 6i an eigenvalue of 5 with a < 0. 

We can separate J into a sum of two matrices, J = D + N, with a diagonal matrix 
D = diag(A, . . . , A) G A^;(M) and a matrix 



/0 1 
1 



N 



... 0\ 

'•• 

'•. 1 

0/ 



G 



\o 

Because D and commute, DN = ND, we have 

exp( Jt) = exp(Dt + Nt) = exp(Dt) exp(Nt) 
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3. Matrix Variate Stochastics 



and thus 



exp(Jt) = diag(e A ', . . . , e xt ) exp(Nt) 



e xt exp(Nt) = e M e at exp(Nt) 



The matrix N is nilpotent, as iV' = 0. Hence, exp(Nt) is the finite sum 



exp(Nt) 



1-1 1 

k=0 



N 



n 



Vo 







(i-1) 



eM, 



J 



For a < and M oo we know that e at pt converges to zero for every polynomial p t in t. 
Hence, lim^oo e at exp(Nt) = 0. As e M always has absolute value one, we also have that 
lim^oo exp( Jt) = 0. With (3.22) it is now obvious that 

lim exp(Bt) = 

t— >oo 

The next step is to observe that S p is a finite dimensional and normed space with the 
norm induced by the inner product trace. Hence, the linear operator A^ 1 : S p — >■ S p is 
continuous and with the above can conclude that ty t from (3.21) converges pointwise to 



lim iff t 

t— >oo 



A-\A T A) 



and thus 



t— >oo 



1 



lim P x t(Z) = etr ( - -Z T Z A' 1 (A T A) 



f(Z) 



(3.23) 



Clearly, f is continuous at Z = 0. With Levy's Continuity Theorem we know that there 

exists a probability measure /x on M P {R) such that f(Z) = fi(Z), and P Xt W -T //. 
The function / is the characteristic function of a normal distributed random matrix. 
That implies by Theorem 3.14 that X t converges in distribution to a random matrix with 
normal distribution /i, 

li = N n ^,I n ®A-\A T A)) (3.24) 

and the limit is independent of the initial value xo- Thus, and by the fact that X is a 
Markov process according to Theorem 3.39, the limit distribution in (3.24) is a stationary 
distribution. □ 

Corollary 3.51. Let X ~ OUV n . v {A, B,x ) with a matrix B e M P (R) such that B e S~ 
and A T A and B commute. Then the stationary distribution of X is given by 



AC 



n,p 



0,/n 



-A AB~ 



Proof. 
A~\A r A) 



-A T AB- 1 & A T A = A(—-A T AB 



(BA' AB' 1 + A A) = A r A 



□ 
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4. Wishart Processes 



In this chapter the main work on the theory of Wishart processes will be done. As an 
introduction we begin with the one dimensional case, called the square Bessel process. 
Afterwards, we focus on the general Wishart process and give a theorem about the ex- 
istence and uniqueness of Wishart processes at the end of section 4.2. In section 4.3, we 
show that some Wishart processes can be expressed as matrix squares of the matrix vari- 
ate Ornstein-Uhlenbeck process from section 3.5. Eventually, in section 4.4 we give an 
algorithm to simulate Wishart processes by using an Euler approximation. 



4.1. The one dimensional case: 
Square Bessel Processes 

The square Bessel process is the one-dimensional case of the Wishart process - in fact, 
even a special case of the one-dimensional Wishart process. We consider it separately, 
because it is much more elementary than for higher dimensions. 

Definition 4.1 (Square Bessel Process). Let a > 0, f3 be a one- dimensional Brownian 
motion and xq > 0. A strong solution of 

dX t = 2y/X~ t d/3 t + otdt, X = x (4.1) 

is called a square Bessel process with parameter a and denoted by X ~ BESQ(«,Xo). 

Remark 4.2. Let B denote a Brownian motion on W 1 (that is an W 1 -vector of n 
independent one- dimensional Brownian motions). The Bessel process is then the Euclidean 
norm of B, that is V B T B. Hence, the process X with X t = Bj B t is often called square 
Bessel process in the literature. Using Theorem \4-9\ and Theorem 3.30 one can show that 



X ~ BESQ(n, xq). Hence, using (4-1) for the definition of the square Bessel process is 



more general as it allows n = a to be any real, non-negative number. 

Before we give a comprehensive theorem about existence, uniqueness and non-negativity 
of the square Bessel process, we state the following results for one-dimensional stochastic 
differential equations: 



Theorem 4.3. (Pathwise Uniqueness, cf. Yamada and Watanabe '1971a, Theorem lj) 
Let 

dX t = a{X t ) dfa + b(X t ) dt (4.2) 
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be a one-dimensional stochastic differential equation where b, a : R — > R are continu- 
ous functions and (3 a one- dimensional Brownian motion. Suppose there exist increasing 
functions p, k : (0, <x>) —> (0, oo) such that 



- *{v)\ < tilt ~ v\) V ^ 
-l 



with 



p (u) du 



oo 



(4.3) 



o 



and 



with 



-l, 



■u) du 



oo 



(4.4) 



TTien pathwise uniqueness holds for (4-2). 



Theorem 4.4. (Comparison Theorem, Revuz and Yor 2001, Chapter IX, Theorem 3.1]) 
Consider two stochastic differential equations 

dX\ = a{X]) dfo + b\Xl) dt, dX 2 = a{X 2 ) d(3 t + b 2 {Xf) dt 

that both fulfill pathwise uniqueness and let b l ,b 2 be two bounded Borel functions such that 
b 1 > b 2 everywhere and one of them is globally Lipschitz. If (X x ,/3) is a solution of the 
first, and (X 2 ,(3) a solution of the second stochastic differential equation (X 1 ,X 2 defined 
on the same probability space), w.r.t the same Brownian motion f3 and if Xq > Xq a.s., 
then 

P[X] > X 2 for all t 6 R + ] = 1 (4.5) 

Theorem 4.5 (Existence, Uniqueness and Non-Negativity of the Square Bessel Process). 
For a > and Xq > there exists an unique strong and non-negative solution X of 



X t = x + 2 



X s d(3 s + at 



(4.6) 



on the entire interval [0, oo). 

Moreover, if a > 2 and Xq > the solution X is positive a.s. on [0, oo) 



Proof. We show with Theorem 4.3 that pathwise uniqueness holds for the stochastic 
differential equation 



dX t = 2J\X t \ dfa + « dt, X 



Xq 



(4.7) 



like in 



Revuz and Yor 



2001 



have that 



p. 439]. Since \y/z — \fz'\ < yj\z — z'\ for all z,z' > 0, we 



<2vW 



with p{u) = 2y/u. Clearly, 



(u) du 



du 



u 



(4.t 
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Definition and Existence Theorems 



With Remark 3.33 we conclude that there exists a unique strong solution for (4.7). 



Next, we show that our solution never becomes negative: First, consider the case x\ = 
and a 1 = 0. Obviously, X 1 = is our unique strong solution in this case. Secondly, 
consider an arbitrary Xq > and a 2 > 0. From the above, we have a unique strong 
solution X 2 . The comparison theorem implies that 



P[X 2 > X\ for all t G 



P[X 2 > for all t G 



and thus X? is non- negative for all t almost surely. Hence, we can discard the |.| in (4.7) 



and X also is the unique strong solution of (4.6) 



Finally, [Revuz and Yor |2001 Proposition 1.5, Chapter XI] has shown that for a > 2, the 
set {0} is polar, i.e. P(inf{s : X s = 0} < oo) = for all initial values xq > 0. Hence, in 
this case the unique strong solution X is positive for all t G K+. □ 



Remark 4.6. For a > 2, one could also use McKean's argument (Theorem 3.44) t° show 



that the square Bessel process is positive, analogously to the Proof of Theorem 4-H 



4.2. General Wishart Processes: 
Definition and Existence Theorems 

Definition 4.7 (Wishart Processes). Let B be a p x p-dimensional Brownian motion, 
B ~ BAi p , Q G M. p (M) and K G A^ P (M) be arbitrary matrices, s G 5+ the initial value 
and a > a non-negative number. Then we call the stochastic differential equation 



dS t 



S t dB t Q + Q v dB v t y/S t + (3 t K + K'St + aQ L Q) dt, S = s 



(4.9) 



the Wishart SDE. 

A strong solution S of (4-9) in 5+ is said to be a (p x p-dimensional) Wishart process 
with parameters Q, K, a, sq, written S ~ WV P (Q, K, a, sq). 



To understand (4.9) intuitively, it may help to have a look at a approximation of the 
form 



rt+h rt+n 

S t+ h &S t + y/S~t dB s Q+ Q T dBjy/S~ t + {S t K + K T S t + aQ T Q)h 
Jt Jt 

Here we can see that Q controls the covariance of our normal distributed fluctuations, 
and that this fluctuations are proportional to the square root of our process: 

/t+h 
dB s Q\S t ~y/s t - Af p , p (0, I p ® Q T Qh) 

Hence, the fluctuations decrease quickly if our process goes to zero. 

As aQ T Q is positive semidefinite, the parameter a determines how large the drift away 

from zero is. 



Just like in the one-dimensional case, the solution of (4.9) has a mean reverting property. 
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To understand that, we consider the deterministic equivalent of (4.9), that is the ordinary 
differential equation 

dS t 
dt 



S t K + K r S t + aQ T Q, S = s 



To simplify the notation, we define the operator C : S p — > S p , X i-> XK + K T X and set 
M = aQ T Q. Then 

d ^ = CS t + M, S = s (4.10) 



From the theory of ordinary linear differential equations (see Timmann 2005, p. 83], for 



example), we know that the solution of (4.10) is given by 



S t = e ct ( .s„ 

Indeed, by partial integration follows 
dS t 



e~ Cu duM 



(4.11) 



dt 



C I e ct I s 



e- Cu duM ) 1 + e ct e- ct M 



ct-ct 



Evaluating the integral in (4.11) gives 



S t = e ct {{s + C- 1 M) -C~ l M 



(4.12) 



If the eigenvalues of K only have negative real parts, Re(a(K)) C (— oo, 0), then, because 
of cr(C) = <y{K) + ct(K), we also have Re(a(C)) C (— oo,0). From the proof of Theorem 
3.50 we get that lim^oo e ct = and thus from (4.12) 



lim S t = -C- X M 

t— >oo 



Hence, the deterministic solution of (4.10) converges to — C l M. Thus, the stochastic 



solution of (4.9) will fluctuate around — C M. 



Considering the fact that the fluctuations decrease quickly if our processes goes to zero 
and that we have a non-negative definite drift aQ T Q away from zero one may think that 
this process never leaves «S+. As we have shown in the last section, for the one dimensional 
case (square Bessel process) we have a unique strong solution for all t that never becomes 
negative. Unlike in this case, we can not show for p > 2 that there still exists a unique 
strong solution after the process hits the boundary of Sp (that means St G 5+\5+) the 
first time. We are only able to give sufficient conditions such that the process stays in the 
set of all positive definite matrices, and then we have a unique strong solution for all t. 



One reason why we cannot transfer the proof of Theorem 4.5 to the matrix variate case is 



that Theorem 4.3 cannot be generalized to matrix variate stochastic differential equations. 
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In the matrix variate version of Theorem 4.3 as stated in Yamada and Watanabe 1971b 



Theoreml], the constraint (4.3) turns to 



but 



for p(U) 



_ p~ 2 (U)UdU 
{ues+-.\\u\\<i} 

U i — V p 2 (U)U~ l is concave 



p- 2 {U)UdU 



oo 



(4.13) 
(4.14) 



I p dU < oo 



{t/eS+:||i7||<l} J{ues+-.\\u\\<i} 
U. Hence the theorem can not be applied to our case. 



Furthermore, Yamada and Watanabe 1971b, Remark 2] have also shown that (4.13) is, 
for p > 3, nearly best possible in the sense that, if f ^+. IIJTM „ p~ 2 (U)U du < oo and p 
is subadditive then, a stochastic differential equation can be constructed that has two 
solutions, and thus pathwise uniqueness cannot hold. But the fact the we cannot use 



Yamada and Watanabe 1971b, Theoreml] does not give any evidence whether pathwise 
uniqueness holds for the Wishart SDE or not. 



Before we can begin our mathematical analysis of (4.9), we need a few auxiliary results: 

Lemma 4.8 (Quadratic Variation of the Wishart Process). Let S ~ WV P (Q, K, a, s ). 
Then 

d[Sij, Ski]t = S tt ik(Q T Q)ji dt + S t: ii(Q T Q)jk dt + S t jk(Q T Q)u dt + S t ji(Q T Q)ik dt 
as long as S exists. 



Proof. Define H t := y/S~ t dB t Q + Q T dB?^/S~ t . Then [S iv S M ] = [Hy, H, 



ki\ 



Ht,ij — ^ ^J y\St)indBt,nmQmj "I" QrnidBt^nm[\/~St) 
m,n 

The first summand of d[Hij, H k i] t is equal to 

^ ^ (V^t) in^y/St) knQmjQ ml dt 



Because St is symmetric this term can be simplified to 

St,ik(Q T Q)ji dt 

The other three summands can be evaluated in the same way. □ 

Lemma 4.9. Let S e S+ be a stochastic process, B ~ BM P and h : M P {M.) — > M P (R). 
Then there exists a one dimensional Brownian motion (3 h such that 



tr (j^h{S u )dB^j = 



tr{h{S u yh{S u ))dft 
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Proof. Define 

h(S-, 



Ln=l J ° 



n,h dB 



^tr(h(S t )?h(S t )) 



t,ni 



Observe that the denominator is zero if and only if the numerator is zero, so we make the 
convention jj := 1. Then we have by definition 



tr(h(S t )dB t ) = KSt) in dBt M = y/tr(h(S t ) T h(S t )) d$ 

i,n=l 

For every i, n — 1, . . . ,p we have 



o \y/ti(h(S) T h(S)) J Jo tr(h(S) T h(S)) 

T V UfQ\2 



< 



„ ^tmsYKS))^ - 1 <0 ° 



and therefore (5 h is a sum of continuous local martingales and thus a continuous local 
martingale itself. Furthermore 

and with Levy's Theorem the proof is complete. □ 
Lemma 4.10. The matrix variate square root function is locally Lipschitz on S£. 



Proof. See |Stelzer| |2007| . □ 

Theorem 4.11 (Existence and Uniqueness of the Wishart Process I). 

For every initial value So G S£ there exists a unique strong solution S of the Wishart SDE 

(4-9) in the cone of all positive definite matrices up to the stopping time 

T = inf{s : det(S s ) = 0} > a.s. 



Proof. We check that the assumptions of Theorem 3.34 are fulfilled. Observe, that the 
set U = S£ is open and that there exists a sequence of convex closed subsets (U n ) ne ^ of 



U that are increasing w.r.t. C and UneN U n = U . Indeed, observe that the function that 
maps every Matrix M G <Sj" to its smallest eigenvalue, 



:£+-► (0,oo),Mh+ A min (M) 



min v T Mv 

\\v\\=l 



is continuously differentiable, see Deuflhard and Hohmann 2002, Lemma 5.1]. Hence, 

/ \ 



U n := {M G S+ : A min (M) > -} = A^ n 



n 



-, oo 



closed 
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is a closed set; and also convex, as for all Mi, M 2 G U n , a G [0, 1]: 



A min («M! + (1 - a)M 2 



= min /(aMi + (1 - a)M 2 )v 
IMI=i 

> a min v T Mit> + (1 — a) min v T M 2 v 

IMI=i IMI=i 

1 , ,11 

> a- + (1 - a)- = - 

n n n 



Next, we define for S G <Sj~ the linear operator by 



and as before 



Z s = Z{S) : M P (R) 4 5 P) l4 VSXQ + Q T X T VS 



C:S P ^S P ,X^XK + K T X 



Then we can write (4.9) in the form 



/ Z S JB U + [ (CS u + aQ T Q)du 
Jo Jo 

Z{S U ) 



CS u + aQ L Q 



d 



ul, 



■=f(Su) 



f{S u )dZ u 



where Metivier and Pellaumail 1980b give a formal justification why we can integrate 
w.r.t Zs u dB u . Obviously, Z is a continuous semimartingale. We still have to show that 
the function / is locally Lipschitz. For any norm given on J\4 P (M), we define a norm on 
M 2p , P (M) by 



\M 



2p,p{ 



\x\ 



Mr 



+ \\Y\ 



Mr 



VITe M, 



but discard the subscripts as it should be obvious which norm to use. Then, for all 
S,R G Sp we have 

\\f(S)-f(R)\\ = \\Z s -Z R \\ + \\C(S-R)\\ 
Because C is a bounded linear operator (dimS p < oo), we have 

\\C(S - R)\\ < \\C 
Let be S, R G <Sj~. For X G J\4 P (M) arbitrary we have 



S-R\\ VS,R G «S+ with ||C|| < oo 



(Z s -Z fl )X|| = \\(Vs- VR)XQ + Q T X T (VS- VR)\\ 
< 2\\y/S-VR\\\\X\\\\Q\\ 



=> ll-Zo-Zr 



sup 

XeA4 p (R)\{0} 



( ^~^ )X|I <2||^-^11[|Q|| 
-A 
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Let now be Y G Sp. As the matrix variate square root function is locally Lipschitz, 
there exists an open neighbourhood U{Y) of Y and a constant C(Y), such that for all 
S,ReU(Y) 

\\VS-y/R\\ < C(Y) \\S-R\\ 



Hence, we have for all S,R & U (Y) 
\\Zs 



Z R \\<2C{Y)\\Q\\\\S-R\ 

~<y(Y) 



C'(Y) \\S-R\ 



i.e. Z is locally Lipschitz on <S+. At all / is locally Lipschitz because 

\\f(S) - f(R)\\ < (C'(Y) + ||C||) \\S-R\\ = K\\S-R\\ VS,Re U(Y) (4.15) 

V v ' 

= :K 

Hence, we know that there exists a non-zero stopping time T > such that there exists 
a unique 5^-valued strong solution S of ( 4.9[) for t G [0, T). If T < oo, SV either hits the 
boundary of <S+ or explodes. We show that the latter cannot happen. Fix R 6W (Y) and 
set S 1 = y. Then we from (4.15) 



\\f(Y)-f(R)\\ < 
\\f(Y)-f(R)\\ 2 < 



K(\\Y\\ 
K 2 (\\Y\ 



+ \\R\\) 

l 2 + 2||y| 



\R\ 



\R\ 



with L = max{K 2 \\R\\ 2 ,2K 2 \\R\\,K 2 }. Because of ||Y|| < 1 

||/(F) -/( J R)|| 2 <2L(1 + ||F|| 2 



<L(1 + \\Y\\ + 
h IIFII 2 



Hence, K i— > f(Y) — f(R) satisfies the linear growth condition, and so does / :Y (-> /(F). 

Thus, if T < oo, we know that St hits the boundary of 5+ the first time, i.e. St £ <S+, 
but St ^ <S+, and G 5+ for alH < T. Hence 

T = ufi> : S u i S+} 
T = inf{s : det(S s ) = 0} 

□ 



or 



In other words: A unique strong solution S of (4.9) exists as long as S stays in the 
interior of «S+. 



As a consequence, in order to show that there exists a unique strong solution S of (4.9) 
in Sp on the entire interval [0, oo) we only need to show that S t G <S+ for all t G K + . 
Hence, the next step is to give sufficient conditions that guarantee that St G 5+ for all 
t G M+. First, we do this for the case with a zero drift, K — 0. Later, we can generalize 
our results for K ^ using a Girsanov transformation. 

Theorem 4.12. For every initial value s$ G Sp there exists a stopping time T > 
and a unique strong solution of the Wishart SDE (4-9) on [0, T). Suppose T < oo. T/ien 
i/iene exists a unique strong solution (S t )te[o,T] of the Wishart SDE (4-9) on [0,T) stzc/i 
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that S t G S+ for all t G [0,T) and S T G 8S+ = S+\S+ . Then, for every x G W with 
x T Q T Qx = 1 the process (x T S t x) t( z[ 0j T] is a square Bessel process with parameter a and 
initial value x^SqX, (i t Stx)t^[o,T] ~ 

BESQ(« ) . Moreover, if a > 2 then the process 

(x T S t x) t £[o t T] remains positive almost surely. If furthermore Q G GL(p) then: 

For every y G W, y ^ 0, it holds that y T S t y > for all t G [0, T] a.s. 



Proof. We get the required solution (S t )t e[o,T \ of (4.9) by Theorem 4.11 if we attach the 
one point St to the solution on [0, T) of (4.9). 

Let x G W be an arbitrary vector with x T Q 1 Qx = 1. The matrix of all partial derivatives 
of the map Ai p {R) E5 S x T Sx G E is equal to 

D(x T Sx) = 

and hence all second derivatives are zero. 
With Ito's formula (Theorem 3.27) we get 



{XiXjjij 



XX 



d(x T S t x) 



tr(xx T dSt) 

tr{Qxx T ^/S~ t dB t ) + tr( y/S~ t xx T Q T dBj) + tr(xx T aQ T Q dt) 
2ti(Qxx I ^/S~ t dB t ) + ti (xx T aQ T Qdt) 
2y/tr(S t xx T Q T Qxx T )dp t + atr(x T Q T Qx) dt 
2 ^/tr(S t xx T ) d(3 t + adt 
2 \j x T S t x dfit + adt 



where we used Lemma 4.9 Hence, (x Stx)te[o,T] ~ BESQ(a, x s x 



If a > 2 we know from Theorem 



4.5 



that the process (x T S t x) t £{o,T] is strictly positive for 
all t G [0,T] a.s., because the initial value is positive, x T sox > 0. 

Now suppose that Q G GL(p). Let y G M. p ,y ^ 0, then for x := (y T Q T Qy)~zy it holds 
that x T Q T Qx = 1 and by the above that x T S t x > for all t G [0, T] a.s. Thus, we also 
have y T S t y > for all t G [0, T] a.s. □ 

First it may sound surprisingly that for fixed y ^ 0, the process (y T S t y)te[o,T] is almost 
surely positive at T, y T STy > a.s., even though that there exists an z G W, z ^ 0, such 
that z t Stz = a.s. But this just tells us, that it is 'unlikely' to find such a vector z in 
advance. Before we continue with the next theorem, we state 

Theorem 4.13. Let S ~ WV P {Q, 0, a, s ). Then, for £ ^ 

d(det(S t )) = 2 det(S t )Jtr(Q T QSr l )d/3 t + det(S t )(a + 1 - p)tT(Q T QS^) dt (4.16) 



d(det(S t Y) = 2£det(S^ 



tv(Q T QSr 1 ) d/3 t + tr(Q T Q5,- 1 )( 



a — 1 — p 



d(\n(det(S t ))) = 2Jtr(Q T QSr 1 ) dfr + 



a 



V 



l)tr(Q T QSt l )dt 



d(ln(det(£ t ))) = 2Jtr(Q T QSr 1 ) dfc for a = p + 1 



(4.17) 
(4.18) 
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for t G [0, T) with T = inf{s : det(S' s ) = 0}, where (3 is a one- dimensional Brownian 
motion. 



These results can also be found in Bru 1991 , p. 747] without proof. 
Proof. We consider the SDE 

dS t = ^S t dB t Q + Q T dBj ^S t + aQ T Qdt, S = s 



First, we prove Equation (4.16): 



According to Ito's formula, we have 



1 p 

d(det{S t )) = tr(£>(det(^)) dS t ) + - J] 



3 2 



2 dSt a 3 S+ hi 

ij,k,l=l ' J ' 



det(St) d[Sij, S, 



kilt 



Using Lemma [2. 6| we get 

tr(L>(det(£)) dS t ) = det(5 , t )tr(5 , t " 1 dS t ) 

= det(5 t )tr(^ 1 ( v / ^^g + Q T dBjy/S~ t + aQ T Qdt)) 

= det(S t )[tr(QSP dB t ) + tr{S^Q T dBj) + atv(Q T QSt 1 )] 

= det(5 t )[2 V / tr(g T Q,S t - 1 ) d(3 t + atr(Q T QS^)} 



where we used Lemma |4.9| in the last equation. For the second order term, we get 
p 

2 



1 p 



d 2 



i,j,k,l=l 
P 



3S t>ij dS t Ml 



det(St) d[Sij, S k , 



kl\t 



">kl\t 



\ J2 det(S)[(Si%(Si%-(Sr% k (Si%)d[S i:i ,S k 

i j',fc,i=l 
1 P 

- detiS^S-'MS- 1 )^ - (Si 1 U(S-\][S t , k (Q T Q) jl dt + S t>u (Q T Q) jk dt 

+S t ,jk(Q T Qh dt + S t ,AQ T Q)ik dt] 



i j',fc,i=l 



det(S t )[(l-p)tT(QQ L S^)dt] 



where we used Lemma 4.8 and Lemma 2.6 , again. At all, we get equation (4.16) 



d(det(S t )) =2det(S t )Jti(Q T QSi 1 ) d(3 t + det(S t )(a + 1 - p)tr(Q' L 'QS^) dt 



For Equation (4.17), we observe that 



If we set 



dXl = ZXt 1 dX t + ^ - l)Xt 2 d[X, X] t 



X t := det(S t 



(4.19) 
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then (4.16) is equal to 



and 



dX t = 2X t [ JtriQTQSr 1 ) dfr + - + I P tr(Q T QS- 1 ) dt 



d[X, X] t = 4X 2 tr(Q T QSr 1 ) dt 



If we insert (4.20) into (4.19) we get (4.17): 



dXf = ZXt^Xt (yftriQTQSr 1 ) d(3 t + a + l 2 P ti(Q T QS- 1 ) dt 



2^ 



(4.20) 



txiQTQSr 1 ) dfo + a+ l P tr(Q T QS- 1 ) dt + (£- l)tv(Q T QS- 1 ) dt 



tr(Q T QS t - 1 )dA + tr(Q 1 QS', 



T / a — 1 — p 



+ ndt 



To prove the last equation, observe that 



d(\n(X t )) = X7 1 dX t - -X- 2 d[X, X] t 



(4.21) 



and insert (4.20) into (4.21) 



d(\n(X t )) = X^HXt ( JtiiQTQSr 1 ) d(3 t + a+ l P ti{Q T QS7 1 ) dt 



- X~ 2 4X?tr {Q T QS7 1 ) dt 



2 I JtiiQtQSr 1 ) d/3 t + 



a + l -p / n Tn0-i^ 



tr(Q l QS7 L ) dt ) - 2tr(Q T QS7 1 ) dt 



2Jtr(QTQSf l ) d/3 t + (a + l-p- 2)tr(Q T QS7 1 ) dt 



2\ tr(Q T QS7 1 ) d/3 t + (a - p - l)tx(Q T QS^) dt 



which equals (4.18), if a = p + 1. 



□ 



Theorem 4.14 (Existence and Uniqueness of the Wishart Process II). 

Let K = 0, So G Stand a > p + 1. JTien t/iere exists a unique strong solution in 5+ o/ 

tae Wzs/jart S'DE on [0, oo). 



Proof. We consider the SDE 

dS t = y^tdBtQ + Q T dBj^/S~ t + aQ T Qdt, S = s 

and show that 

T = inf{s : det(S s ) = 0} = oo 
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where we adopt the idea from Bru 1991, p. 734] to use McKean's argument. As a matrix 
norm we choose 

v 

\A\ \ := max VlA-fcl (4.22) 



: max > \Ajk\ 

J = l 

Observe that then |tr(A)| < p||^4|| for every matrix A G A / l p ( 
Let us assume that T < oo. 



First consider the case a = p + 1. Then we have from (4.18) 



d(\n(det(S t ))) = 2^tr(Q T QSr 1 )d/3 t 

We can define an increasing sequence of stopping times (T n ) ng N with 

T n := inf{t Gl + : \\S^\ \ = n}, 

where T n < oo because T < oo, that converges to T such that ln(det(S , min { ti T' n })) is a 
martingale: 



E 



2Jtr(Q T QS7 1 : 



4£( / tT(Q T QS^)dt 



o 



< 4E(J j9||Q T Q|Mt 
= 47>||Q T Q||n<oc 

where we used that 

o<tr(Q T Q^- 1 ) ^pHqtq^- 1 !! ^plig^HH^- 1 !! <p||Q T Q||n 

for t G [0, T n ). That means by definition that ln(det(S t )) is a local martingale on [0, T). 



Now we can apply McKean's argument, Theorem 3.44, with r t = det (St) and h = In. By 
assumption we know det (So) > and we have shown above that h{r t ) = ln(det(Sf)) is a 
local martingale on [0, T). Obviously, ln(det(St)) converges to -oo for t — > T and hence 
McKean's argument implies T = oo. That is a contradiction to our assumption. Logically 
consistent, we can conclude T = oo. 



In the case a > p + 1, we set £ 



p+l—a 



< and 



T n := inf{t G R+ : {{S^W = n} A inf{t G M+ : det(S f _1 ) > n} 
where a Ab := inf{a, b}. Again, (T n )„ eN is a sequence of stopping times that converges to 



T. From (4.17) we know that 



d(det{Stf) = 2£det(St)Vt I .(Q T QSr 1 ) ( iA 
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We show again that det(StAT n )^ is a martingale for every n e N: 

rT n / , \ 2 \ / „T, 



E UT " ^det(5 i )ytr(Q T Q5 t - 1 )^) <2tj = 4£ 2 £ Qf " det^^tr^QS^ 1 ) 

< 4£ 2 £^ V 2? p||Q T Q||ncft) 

= 4eT nP \\Q T QWn 1 -^ <oc 

Hence, we can apply McKean's argument for the local martingale det(St)^ on [0,T), 
because det(Sf)^ converges to infinity for t — > T. The same reasoning as above implies the 
contradiction T = oo. 



Finally, Theorem 4.11 proves the statement. 

Observe that, for a < p + 1, we have £ > and thus det(St)^ does converge to zero, so 
we cannot apply McKean's argument in this case. □ 

Eventually, we are able to state the final theorem about the existence and uniqueness 
of the Wishart process, which is the main achievement of this section. 

Theorem 4.15 (Existence and Uniqueness of the Wishart Process III). 

Let Q E GL(p), K e M P (R), s e <S+ and the parameter a > p + 1. For B ~ BM P 

consider the stochastic differential equation 



dS t = V StdBtQ + Q'dBt V S t + aQ L Qdt, S = s (4.23) 



From Theorem 4-14 we know that there exists an unique strong solution (S,B) of (4-23) 
on [0,T S ) with T s = inf{s : det(5' s ) = 0} = oo. Define the process 

Uf^-y/ftKQ- 1 

and suppose that 

^ ftrf- fuJdB,))) (4.24) 
\ \ Jo ///te[o,oo) 

is a martingale . 

Then there exists an unique strong solution (S,B) = ((S t )teR + , (Bt)tm+)> B ~ BA4 P , in 
the cone of all positive definite matrices <Sj" of the Wishart SDE 



dS t = \/S t dB t Q + Q T dBj\/S t + (S t K + K T S t + a Q T Q) dt, S = s (4.25) 
on the entire interval [0,oo). 

Proof. In this proof, with solution we always mean an 5^-valued solution. 



With Girsanov's Theorem (Theorem 3.41) we are able to conclude that 



L\ := f U,dt + B t = - [ sjl^sKQ- 1 dt + B~ t (4.26) 
Jo Jo 
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defines a Brownian motion w.r.t. the equivalentprobability measure Q as defined in (3.15). 



Some calculation shows that 



dSt 



StdBtQ + Q'dBt ^JSt + aQ'Qdt 



S t {dB t + y StKQ' 1 dt)Q + Q l dB t \JS t + aQ l Qdt 

T 



S t dB t Q + Q T dB t ^S t + S t K dt + aQ T Q dt 
S t dB t Q + Q T dBjJJ t + {S t K + K T S t + aQ T Q) dt 



(4.27) 



Hence {S, B) is a solution of (4.25) on [0, oo 



From Theorem 4.11 we know that there exists a unique strong solution S of (4.27) on 
the interval [0, T s ) with T s = inf{s : det(S' s ) = 0} > 0. The pathwise uniqueness implies 
uniqueness in law, thus S and S have the same distribution, and so ha ve T s and T s . 
Hence, T s = T s = oo and {S, B) is an unique strong solution S of (4.25) on the interval 
[0,oo). 



□ 



Remark 4.16. In the case that the matrices Q T Q and K commute, Bru t 1991, p. 748] 
has shown that (4-24) is a martingale by extending the methods of Pitman and Yor j!98^ 
for the one-dimensional case. 



Assumption 4.17. For the rest of this thesis, we will assume that (4-^4) ^ s a rnartingale. 



Before we end this chapter, we want to compare our results to the one stated in Bru 



1991 : 



Theorem 4.18. (Cf. Bru '1991, Theorem 2"]) If a e A p := {1, . . . ,p- 1}U (p- 1, +oo), 



A E GL{p), B 6 S p , so £ Sp and has all its Eigenvalues distinct, and M is a p x p- 
dimensional Brownian motion, then the stochastic differential equation 

dS t = Si dM t {A T A) I + {A T A) I dMj s} + {BS t + S t B) dt + aA T A dt, S = s (4.28) 

has a unique solution on [0, r) if B and {A T A)^ commute, whereas r denotes the first 
time of collision, i.e. the first time that two eigenvalues of S become equal. With the term 
unique solution is meant a weak solution that is unique in law. 



Compared to (4.28), our definition of the Wishart SDE (4.9) is more general, as we 



allow the drift matrix K to be an arbitrary matrix, whereas in (4.28) the matrix B has 



to be symmetric negative definite. In the case a G \p+ 1, oo), the result in Theorem 4.15 



extends the one in Theorem 4.18 because we prove the existence of a strong solution, 
that is unique up to indistinguishability and takes almost surely values in the cone of 
all symmetric positive definite matrices. Furthermore, this solution has infinite lifetime 
independently of any collision of the eigenvalues of our solution. 
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4.3. Square Ornstein-Uhlenbeck Processes and their 
Distributions 

Theorem 4.19. 

Let n G {p + l,p + 2,...}, A G GL{p), B G -M P (M) ; s e <S+ and X ^ OUV n , p {A, B,x ) 
be an Ornstein-Uhlenbeck process with XqXq = sq. Then there exists a Brownian motion 
M ~ BAip such that the unique strong solution (S, M) in 5+ of the stochastic differential 
equation 

dS t = S? dM t {A T A) I + {A T A) I dMj s\ + {B T S t + S t B) dt + nA T A dt, S = s (4.29) 
is given by the square Ornstein-Uhlenbeck process S = X T X = (XjX t ) te ^ + . 



Note that the class o f sto chastic differential equations of the form (4.29) is exactly the 
class of Wishart SDEs (Q with Q = \J A T A G «S+, K = B G M P {R) and 



a = n 



G {p + l,p + 2, . . .}. Thus, everything we have established in the foregoing section 



is still valid for the subclass of SDEs (4.29). 
Proof. We define 



S t :=XjX t VtG 



and 



■= [ ^/S^X] dW s A(^A?A)- 1 g M P (R) VtG 
Jo 



The matrix square root y/A T A is positive definite and therefore invertible. Now we show 
that M is a Brownian motion. Because of 

E (j\^XU{A T A)-^{^X^A{A?A)^)ds 

= e(J {A T A)-^A T X s S; 1 XjA{A T A)- 1 ^ds 

= [ {A T A)-^A T A{A T A)-^ ds 
Jo 



tl p < oo a.s. 



M t is a local martingale. Furthermore, observe that 

dM m = J2(\f^ X ^rndW t!mn (A(^A)- 1 



n-J 



and 

m,n 

= (^ 1 X t Xj^% k ((V^r l A T A(VA^A)- l ) jl 

= (Ip)ik(Ip)jldt 

= l{i=k}l{i=i} dt 
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and zero otherwise 



where we used that 

d[W mn , W m > jn >]t = dt m = m, n = n , 

With Theorem E32] we con conclude that M is a Brownian motion. 

Finally, using the partial integration formula shows us that our pair (5", M) is a solution 

of the stochastic differential equation (4.29). 

dS t = d(X?X t ) = (dX t ) T X t + Xj(dX t ) + d[X T ,X}M 

= (B T Xj dt + A T dWj) X t + Xj (X t B dt + dW t A) + A T d[W?, W t ]f A 

= B T S t dt + A T dW t X t + S t B dt + Xj dW t A + A T d[W?, W t ]f A 

= (B T S t + S t B) dt + A T dW t X t + Xj dW t A + A T d[nI p t} M A 

= (B T S t + S t B) dt + {A T A) s dhlj Sf + Sf dM t {A T A) I + nA T A dt 

= S^ dM t {A T A) 3 + (A T A) 3 dMj s} + {B T S t + S t B) dt + nA T A dt 

where we used that 

n 

d[W?, W t }^ = £ d[W tM , W t , kj \ t = nl {l=3] dt 



k=l 



So far we have shown that (S,M) is a weak solution of (4.29). From Theorem 4.15 we 
know that pathwise uniqueness holds for (4.29), and thus by Remark 3.33 (ii) we have 
that (S,M) is also a strong solution of (4.29). □ 

Theorem 4.20 (Conditional Distribution of the Square Ornstein-Uhlenbeck Process). 
Let n G {p+l,p + 2 ,. ..}, A G GL(p), s G S+ and B G M P (R) with £ -a(B) - a(B). 
The solution of (4-29) has the conditional distribution 

S t \s ~W p (n, E^E^e^V*') 



with 



(4.30) 
(4.31) 



E t = A~\A T A) - A-\e BTt A T Ae Bt ) 
where A~ l is the inverse of A : S p — >• <S P ,X h-> — B T X — XB. 

Proof. The solution S = X T X is given by a square Ornstein-Uhlenbeck process. Theorem 
[3T491 shows that 

X t \x ~ A/" niP (x e B ', 4 <g) E t ) 

with 



From Lemma [3. 131 we know 



XTlxo 



B T tT 



x ; 



Using Lemma 3.20 we achieve 



-l„S T t T 



i.e. 



□ 
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Theorem 4.21 (Stationary Distribution of the Square Ornstein-Uhlenbeck Process). 
Letn G {p+l,p+2,.. .}, A G GL(p), s G <S+ and B G M P (R) with Re(a(B)) C (-00, 0). 
Then the solution of (4.29) has a stationary limiting distribution, that is 

where A~ l is the inverse of A : S p — >■ S P ,X h> — P t X — XB. 



(4.32) 



Proof. From (4.30) and Theorem 3.19 we know for the solution S of (4.29) that St given 
So has characteristic function 



P 5 ' = det(J p - 2iL t Z)- 2etr[iG t (I p - 2z£ t Z)- 1 £ t Z] 



(4.33) 



with 
and 



E t = A- x {A T A)-A- 



T,- l e BTt s e Bt 



From the proof of Theorem 3.50 we know that 

Re{a{B)) C (-00, 0) => lira exp(Pt) = 

and 



t— >oo 



Thus 



and 



lim Et = 

t— i-OO 



lim t = 

t— >oo 



lim P s < = det(/ p - 2z^- x (AM) Z)"* =: /(Z) 



t— >oo 



With Levy's Continuity Theorem it exist a probability measure /z such that ju(Z) 
/(Z) for all Z G A^ P (M) and P 5 * w —> k \i. According to Remark 



3.17 



and Theorem 

3.19[ / is the characteristic function of a random matrix with central Wishart distri- 



bution W p (n,A~ 1 (A T A),Q). Hence, S has limit distribution W p (n,A~ 1 (A T A), 0) regard- 
less of any initial value Sq. As S is a Markov process (Theorem 3.39) we conclude that 
W p (n,A~ 1 (A T A),0) is its stationary distribution. □ 

In the case where B G <S~, and the matrices A T A and B commute, it holds that 
A~ 1 (A T A) = ~lA T AB~ 1 and the stationary limiting distribution is WJn, -\A T AB~ l , 0). 



43 



4. Wishart Processes 



4.4. Simulation of Wishart Processes 

Recall the stochastic differential equation of the Wishart process, that is 

dS s = ^/S~ s dB s Q + Q T dBj ^/S~ s + (S S K + K T S S + aQ T Q) ds, S = s 
For every t e R+, h > integration over the interval [t,t + h] yields 



S, 



t+h 



s t - 



t+h 



S s dB s Q- 



t+h 



Q T dBj a/ S, 



t+h rt+h 

S s Kds+ I K T S s ds+aQ T Qh 
t Jt 

(4.34) 

We now try to approximate the stochastic integral above to make the Wishart process 
suitable for numerical simulation. An easy way of doing this, is the Euler-Maruyama 



method (see Kloeden and Platen 1999, p. 340]): 



St+h = S t + \JS t {B t+h - B t )Q + Q L (B t L +h - Bl)yJS t + (S t K + K L S t + aQ Q)h (4.35) 

We call S the discretized Wishart process. As the Brownian motion has stationary, in- 
dependent increments, we know that the distribution of B t +h — B t is Af p (0, hl 2p ) and is 



independent of all previous increments. Hence, we can use (4.35) to simulate S for any 



fixed step size h > 0. Then we get a process on the mesh (0, h, 2h, . . . , T) for any T 6 
that is (So, S h , S 2 h, ■ ■ ■ , St). 



However, even under the Assumptions of Theorem |4.15[ this discretized Wishart process 
can become negative definite such that we have to stop our simulation before we reach T, 



because the square root in (4.35) is not well-defined anymore. 



To solve this , we observe that S — )■ S for h — > 0. Thus, we can expect S to remain positive 
semidefinite as long as h is sufficiently small. Hence, we introduce a variable step size to 
our algorithm: 

Suppose we have already given the discretized Wishart process (So, Sh, S 2 h, ■ ■ ■ , S t ) and 
the discretized Brownian motion (0, B h , B 2 h, ■ ■ ■ , B t ) up to a tim e t < T — h. Suppose fur- 
ther that we calculate S t+ h (and thus B t+h ) according to (4.35) and that S t +h is negative 



definite, i.e. (at least) its smallest eigenvalue becomes negative. Then, we cut our step size 
by half, calculate S t+ h and check again if the smallest eigenvalue of S t+ h is negative. We 

continue this iteratively, until (hopefully) S t+ _h_ is positive semidefinite or, the step size 
falls under a certain value, say ~ 2.2 x 10~ 16 (that is the constant eps in MATLAB). In 
the last case, the step size converges to zero. 

In order to get the value S t+ h, we need to draw B t+ h conditionally on the given values 
B t ,B t+ h). Because of the Markov property of the Brownian motion (see 

t+ h conditionally on (B t , B t+h ). For now, we 



(0, B h , B 2h 
Theorem 



3.39), this is the same as drawing B t 



only consider the i,j — th entry B^ of B. From Glasserman 2004, p. 84] we know that 



B tlit+ n\(B^ = x,B^ +h = x + y) £ J ^ ^4 



h 

i Vh 

2" ' ~2 




X + + -^~ Z ij 
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4.4. Simulation of Wishart Processes 



where Zij ~ jVi(0, 1) and = denotes distributional equivalence. Written in matrix nota- 
tion, we get for the increment of B 



(4.36) 



(B t+h - B t )\(B t = X, B t+h = X + Y) = \y + 



where Z ~ Af PtP (0, 1 2p ). A sample implementation can be found in the appendix. Now we 
give a few examples. 

Example 1. First we begin with the one-dimensional square Bessel process, i.e. a solution 
of 

dX t = 2JXtdp t + adt, X = x := 0.5 



where (3 denotes a one- dimensional Brownian motion. From Theorem \4-5\ we know that this 
process never becomes negative for any choice of a, Xq > and remains positive for xq > 
and a > 2. Here are two sample paths over the interval [0, 1] and initial step size h = 10~ 3 : 

alpha=0.5 alpha=2 





In the case a = 0.5, the algorithm had to reduce the initial step size to 1.5625 ■ 10~ 5 at 
at least one point in order to guarantee non- negativity, whereas for a = 2 this was not 
necessary. Observe that in the case a = 0.5 < 2, the square Bessel process can become 
zero, but gets reflected instantaneously. 

Example 2. Our next example is the 2-dimensional Wishart process 

dS t = ^S t dB t Q + Q T dBj y/S~ t + (S t K + K T S t + aQ T Q) dt, S = s 
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4. Wishart Processes 



with a 2- dimensional Brownian motion B, a = 3, Q = ( J -3 ), K = —AI2 and so = 
We show sample paths for the three different entries of S, that are Sn, S22 and Su, and 
for q2 S ^2 > what would be the correlation in a stochastic volatility model. 



S11 



S22 





S12 



S12/(S11 2 +S22 2 ) 





The algorithm had to reduce the initial step size of h = 10~ 3 by half at some points. We 
have 

Eigenvalues of S 



18 r 
16 
14 
12 h 
10 

8 

6 

4 

2 





I If 



■ ...... V/V',,!^/ I . 



II,/ 

N I f 1 



It" u 1 



0.2 



0.4 



0.6 



0.8 
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4.4. Simulation of Wishart Processes 



for the eigenvalues of S. 



Example 3. The last example are two 5 -dimensional solutions of the stochastic differen- 
tial equation 



dS t 



S t dB t + dBl y/S t + (-AS t + «;) dt, S = 0.1 • I 5 , i = 1, 2 



(4.37) 



For the first one, we have a\ = 7.2, and the smallest and the largest eigenvalue of S look 
like 



smallest eigenvalue 




largest eigenvalue 




In the second case, a 2 = 3.5, we don't know if there exists any solution of (4-37) for 
i = 2. Thus, we do not know whether our simulated process corresponds to a solution of 



(4- 31) for i = 2, and if it does, it does not have to be a Wishart process by Definition 4- ? 



(because we demand the Wishart process to be a strong solution). It may be noted, that the 
algorithm had to reduce the initial step size of h = 10~ 3 to 7.8125 • 10 -6 at some points. 
Again, we shall have a look at the eigenvalues of our simulated process: 
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smallest eigenvalue 
0.12 i 1 1 .— 




largest eigenvalue 
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5. Financial Applications 



We first focus on two well-known financial models that are based on the one-dimensional 
Wishart processes, that is in fact a generalized squared Bessel process. Later, we consider 
the actual matrix variate case. 



5.1. CIR Model 



According to Theorem 4.15 we get for p — 1, q > 0, fcGl, a>p + l — 2, Sq > and an 
one-dimensional Brownian motion B that the stochastic differential equation 



dS t = 2qy/S t dB t + (2kSt 



aq 2 ) dt, 



So = s 



has a unique, strong and positive solution in (0, oo). For k < and the new parametriza- 

2 

tion a = 2q, a = —2k and b = -^jj- the stochastic differential equation gets the form 

dS t = ay^dBt + aib- S t )dt (5.1) 



that is the stochastic differential equation of the CIR process. It is mean reverting as 
a > with b > =f. 

p. 122] has shown, S t \so is distributed as - ( 1 ~^ c P(~ af )) times a 
noncentral chi-square random variable with ^ degrees of freedom and noncentrality 



As 



Glasserman 



2004 



4a exp(— at) 
<t 2 (1— exp(— at)) 



S - 



parameter 

From now on we follow Gourieroux 2007, p.l84f.]. Let's assume that the interest rate r 
follows a stochastic differential equation of the form ( 5.1[ ), 

dr t = <7y/r~ t dW t + a(b-r t )dt (5.2) 

where W is a Brownian motion under a risk-neutral probability measure Q. Then, the 
price of a zero-coupon bond at time t with time to maturity h is given by 

nt+h 



B(t,t + h) = E, 



Q 



exp 



7v dr 



(5.3) 



where Ef denotes the conditional expectation E®[-\cr{r s : s < t}] under the measure Q. 



Then Cox et al. 1985 have shown that 

B(t,t + h)=exp(-f(h)r t -g(h)} 



with functions 



f(h) 
9(h) 



Ac 



c + a (c + a) {{c + a) exp(ch) + c — a] 
ab(c + a)h 2ab f (c + a) exp(c/i) + c 



In 



a- 1 



2c 
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5. Financial Applications 



with c := \Ja 2 + 2a 2 . 

Hence, we have a closed form solution for B(t,t + h) that is exponential affine in r t . 



5.2. Heston Model 



A process S of the form (5.1) can also be used to model the volatility in a stochastic 



volatility Black-Scholes model according to Heston 

d(ln(X t )) =fi t dt+ \Ts t dW t 



where the stock price at time t is denoted by X t . We refer to Gourieroux 2007, Chapter 
2.2.2.1 for details. 



5.3. Factor Model for Bonds 



In contrast to section 5.1 on the preceding page where the risk-free rate r followed a 



stochastic differential equation of the form (5.2), we now want to use a factor model to 



consider corporate bonds jointly with a long term government bond (e.g. T-bond). Again, 

ap 
1. 



we summarize the results of Gourieroux 2007 chapter 3.5.2.]. 

, K. Let us assume that 



Denote by A^t the default intensity for firm i, i 

r t = c + tr(CS t ) 
\,t = <U + tT(DiS t ) Vi = l,...,K 
where c, di > are nonnegative, C, Di G <S>+ and S ~ WV P (Q, K, a, sq), i.e. 

dS t = ^S t dB t Q + Q T dBj ^S t + (KS t + S t K T + aQ T Q)dt, S = s 
under the conditions of Theorem 14.151 



(5.4) 
(5.5) 

(5.6) 



Equation (5.4) is the factor representation for the risk-free rate and equation (5.5) the 



factor representation for the corporate bonds. 

Observe, that tr(CSt) > for all t. Indeed, as C G there exists an orthogonal matrix 
such that C = UDU T where D is a diagonal matrix of eigenvalues > 0. Denote by 



Ui G 



, p, the columns of U. Then we have C = Y7k=i ^kUkuJ and thus 



tr(CSt) = ^2/J, k tr(u k ulS t ) = ^2fikuJS t u k > 

k=l k=l 

With the convention Ao,t = we get for the price Bq of a zero-coupon bond and the prices 
of corporate bonds B^ the formula 

*t+h 

?Q ' 



Bi(t,t + h) = E\ 



exp 



r T + A», T dr 



Mi = K 



(5.7) 



under a risk neutral measure Q. If we insert (|5.4|) and (|5.5|) into (|5.7|) we get 
Bi(t, t + h)= exp(-/i(c + di))E? 



etr -(C+Di 



t+h 



S T dr 



Vi = 0,...,K (5.8) 



with the convention do, D = 0. Gourieroux |2007 shows that there exists a closed form 
expression for (5.8). 
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5.4. Matrix Variate Stochastic Volatility Models 



5.4. Matrix Variate Stochastic Volatility Models 



An application for matrix variate stochastic processes can be found in Fonseca et al. 



2008 . which model the dynamics of p risky assets by 

dX t = diag(X t )[(rl + A t ) dt + sfS t dWt\ (5.9) 

where r is a positive number, 1 = (1, . . . , 1) e IR P , At a p-dimensional stochastic process, 
interpreted as the risk premium, and Z a p-dimensional Brownian motion. The volatility 



process S is the Wishart process of (5.6). 



In contrast to a continous model of the form (1.3), another multivariate stochastic 



volatility model for the logarithmic stock price process can be given by 

dY t = (fi + S t /3) dt + S| dW t , Y = (5.10) 
where W, W denotes a p-dimensional Brownian motion and £ is given by a Levy- 



driven positive semidefmite OU type process, see Stelzer 2007 for details. In this case 



the volatility is not continuous anymore and has jumps. 

If one wants to model the volatility with a time continuous stochastic process, e.g. for 
economic reasons, one could also suggest the model 



dY t = (ji + S t P) dt + Sf dW t , Y = 



(5.11) 



where the process £ was substituted by the Wishart process of (5.6) 
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A. MATLAB Code for Simulating a 
Wishart Process 



This is an implementation in MATLAB of the algorithm described in section 4.4 on 
page 44 



function [S , eigS , timestep , minh] =Wishart (T , h , p , alpha , Q , K , s_0) 
I 

% Simulates the p-dimensional Wishart process on the interval [0,T] 
% that follows the stochastic differential equation 

% dS_t=sqrt{S_t}*dB_t*Q+Q ' *dB_t ' *sqrt{S_t}+ (S_t*K+K ' *S_t+alpha*Q ' *Q) dt 

% with initial condition S_0=s_0. 

% 

% Method of discretization: Euler-Maruyama 
% Starting step size: h 

% In order to guarantee positive semidef initeness of S, the step size will 

°/„ reduced iteratively if necessary. 

% 

% Output : 

% S is a three dimensional array of the discretized Wishart process. 
% eigS is a matrix consisting the eigenvalues of S 
% timestep is the vector of all timesteps in [0,T] 

% minh is the smallest step size used, i.e. minh=min(diff (timestep) ) 
% 

% Author: Oliver Pfaffel 
°/ Email: 0.Pfaffel@gmx.de 
% June 17, 2008 
% 

% 



horg=h; 
minh=h; 
timestep=0; 



[V_0,D_0]=eig(s_0); 
eigS=sort (diag(D_0) ' ) ; 
drif t_f ix=alpha*Q ' *Q ; 



B_old=0; 
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A. MATLAB Code for Simulating a Wishart Process 

B_inc=sqrt (h) *normrnd (0 , 1 , p , p) ; 
vola=V_0*sqrt (D_0) *V_0 ' *B_inc*Q ; 
drift=s_0*K; 

S_new = s.O+vola+vola'+Cdrift+drift'+drift.f ix)*h; 

[V_new,D_new] =eig(S_new) ; 
eigS= [eigS; sort (diag(D_new) ' )] ; 

S=cat(3,s_0,S_new) ; 
t=h; 

timestep=[timestep;t] ; 

flag=0; 

while t+h<T, 

B_old=B_old+B_inc; 

B_inc=sqrt (h) *normrnd(0 , 1 ,p,p) ; 

S_t=S_new; V_t=V_new; D_t=D_new; 

sqrtm_S_t=V_t*sqrt (D_t) *V_t ' ; 
vola=sqrtm_S_t*B_inc*Q ; 
drift=S_t*K; 

S_new = S.t+vola+vola'+Cdrift+drift'+drift.f ix)*h; 
[V_new,D_new] =eig(S_new) ; 
mineig=min(diag(D_new) ) ; 
while mineig<0, 
h=h/2; 

minh=min (minh , h) ; 
flag=l; 

if h<eps, error ('Step size converges to zero'), return, end 

B_inc=0 . 5*B_inc+sqrt (h/2) *normrnd(0, 1 ,p,p) ; 

vola=sqrtm_S_t*B_inc*Q ; 
drift=S_t*K; 

S_new = S_t+vola+vola'+(drift+drift'+drift_f ix)*h; 
mineig=min(eig(S_new) ) ; 
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end 



if flag==0, 

eigS= [eigS; sort (diag(D_new) ' )] ; 

S=cat (3 , S , S_new) ; 

t=t+h; 

timestep= [timestep;t] ; 

end 

if flag==l, 

[V_new,D_new] =eig(S_new) ; 
eigS= [eigS; sort (diag(D_new) ' )] ; 
S=cat(3,S,S_new) ; 
t=t+h; 

timestep= [timestep;t] ; 

flag=0; 

h=horg; 

end 

end 

h_end=T-t ; 
if h_end>0, 

B_inc=sqrt (h_end) *normrnd(0, 1 ,p,p) ; 

S_t=S_new; V_t=V_new; D_t=D_new; 

vola=V_t*sqrt (D_t ) *V_t ' *B_inc*Q ; 
drift=S_t*K; 

S_new = S_t+vola+vola'+(drift+drift'+drift_f ix)*h; 

S=cat (3 , S , S_new) ; 

eigS= [eigS ; sort (eig(S_new) ' )] ; 

timestep= [timestep;T] ; 

end 
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